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DESCRIPTION 

Interferon Alpha Plasmids And Delivery Systems, 
And Methods Of Making And Using The Same 

Related Applications 

This application relates to U.S. patent application 
Serial No. 08/949,160, filed October 10, 1997 and 
International patent application No. PCT/US97 / 18 7 7 9, filed 
October 10, 1997, (Lyon & Lyon Docket Nos . 226/285 US and 
PCT, respectively), both of which are related to U.S. patent 
application Serial No. 60/028,676, filed ^October 18, 1996, 
(Lyon & Lyon Docket No. 222/086 US), all three of which are 
entitled "IL-12 GENE EXPRESSION AND DELIVERY SYSTEMS AND 
USES" (by Nordstrom et al . ) . 

This application is also related to U.S. patent 
application Serial No. 08/798,974, filed February 11, 1997, 
(Lyon & Lyon Docket No. 224/084 US) and International patent 
application No. PCT/US95/17038 , filed December 28, 1995, 
(Lyon & Lyon Docket No. 210/190 PCT), both of which are 
related to U.S. patent application Serial No. 08/372,213, 
filed January 13, 1995, (Lyon & Lyon Docket No. 210/190 US) , 
all three of which are entitled " FORMULATED NUCLEIC ACID 
COMPOSITIONS AND METHODS OF ADMINISTERING THE SAME FOR GENE 
THERAPY" (by Mumper Rolland) . 

Each of the above-mentioned applications are 
incorporated herein by reference in their entirety, 
including any drawings. 

Field Of The Invention 

The present invention relates to gene delivery and gene 
therapy, and provides novel nucleic acid constructs for 
expression of interferon alpha in a mammal, formulations for 
delivery that incorporate a nucleic acid construct for 
expression, and methods for preparing and using such 
constructs and formulations. In particular, this invention 
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relates to plasmid constructs for delivery of therapeutic 
interferon alpha enco d i ng nucieic acids to cells in order to 
^ a " iVit ^ -"»od. of usi„ g those constructs 
(including combination therapy with other agents, such as 

cytokines, preferably IW2) . as weli as ^ * 

preparing such constructs. 


30 


Backgro und Of The Invention 

The following discussion of the background of the 
invention is me rely provided fco ^ ^ - 

understanding the invention and is not admitted to describe 
or constitute prior art to the present invention 


Plasmids are an important element in genetic 


""J 561,6 the "^- Plasmids are usually circular L 

15 :l s C : leS t that ^ " ^to bacterial cells by 

transformation which replicate autonomously in the cel / 
Plasmas typically aIlow for th . amplif ication Qf 
DNA Some plasraids are present in ^ ^ ^ ^ 

cell growth, and after the arrest of protein synthesis, a! 

20 :;:u y k r tT copies per cei1 of a m « - - 

^0 Suzuki et al.. Genetic Analysis, p. 404, 1989 

Current non-viral approaches to human gene therapy 
require that a potential therapeutic gene be cloned into 

the Tit - -H Lar9e qUantitleS ° f * *»<*„i.l "ost harboring 
the plasmid may be fermented and the plasmid D Nfi may bl 
" purified for subseguent use. Current human clinical trial! 
using plasmids utilize this approach. Recombinant DNA 
Advisory Committee Data Management Report, December, 1994 
t 7" ^ 6:535 " 548 - studies normally focus on the 

therapeutic gene and the elements that control its 
expression in the patient when designing and constructing 

rLulaJ ^ PlaSmidS - Gene " 11 >'' therapeutic genes and 
regulatory elements are simply inserted intQ ^ 

cloning vectors that are convenient and readily available 

35 fac, PlaSmid deSl9n and COnstructio " utilizes several key 
35 factors. First , plasmid repiicatiQn Qrigins ^ * 
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20 


30 


plasmid copy number, which affects production yields. 
Plasmids that replicate to higher copy number can increase 
plasmid yield from a given volume of culture, but excessive 
copy number can be deleterious to the bacteria and lead to 
5 undesirable effects (Fitzwater, et al., Embo J. 7:3289-3297 
(1988); Uhlin, et al., Mol. Gen. Genet. 165:167-179 (1979)). 
Artificially constructed plasmids are sometimes unstably 
maintained, leading to accumulation of plasmid-free cells 
and reduced production yields. 
10 To overcome this problem of plasmid-free cells, genes 

that code for antibiotic resistance phenotype are included 
on the plasmid and antibiotics are added 'to kill or inhibit 
plasmid-free cells. Most general purpose cloning vectors 
contain ampicillin resistance (^-lactamase, or bla) genes. 
15 Use of ' ampicillin can be problematic because of the 
potential for residual antibiotic in the purified DNA, which 
can cause an allergic reaction in a treated patient. In 
addition, p-lactam antibiotics are clinically important for 
disease treatment. When plasmids containing antibiotic 
resistance genes are used, the potential exists for the 
transfer of antibiotic resistance genes to a potential 
pathogen. 

Other studies have used the neo gene which is derived 
from the bacterial transposon Tn5 . The neo gene encodes 
25 resistance to kanamycin and neomycin (Smith, Vaccine 
12: 1515-1519 (1994)). This gene has been used in a number 
of gene therapy studies, including several human clinical 
trials (Recombinant DNA Advisory Committee Data Management 
Report, December, 1994, Human Gene Therapy 6:535-548). Due 
to the mechanism by which resistance is imparted, residual 
antibiotic and transmission of the gene to potential 
pathogens may be less of a problem than for p-lactams. 

In addition to elements that affect the behavior of the 
plasmid within the host bacteria, such as E. coli t plasmid 
vectors have also been shown to affect transfection and 
expression in eukaryotic cells. Certain plasmid sequences 


35 
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have been shown to reduce expression of eukarvoti 
eu.aryotic cel is when carried in cis ° Peter sZ et 
Cell. Biol. 7:l563-m« ,™. . .. . (Peterson ' et al . , Mol . 


Biol. 7:1563-1567 (1 987); Yoder 
Biol. 3:956-959 (1983); Lusky e t al N / ' ' ^ 

5 (1981); and Leite et al r o, ««t»re . 293:79-81 

sequences also have^ ^g' ' u (1989)) . P i asmid 

binding sites for L * " ^ *°^°^V contain 

et al r ° transcriptional control proteins (Ghersa 

et al., Gene 151:331-332 (1994); Tullv „«- , lwiersa « 
Siopnys. « es . Comra . 144:1 . (1987) TUUy H 6t al - 

io Endocrinol . 8:405 _ 407 1 ° 99 " 87,, Ih ;; d Kushner - et ai - 

P^ate 1 is of expression '> n t ^ ^ — - 

™ ^7..:^^^^ t that has 

^ents, in different delivery e Z^^"' 10 " ^ ° th " 

» certain leases, inclL„7 p.A'cSlr'"'^ 

International patent n „hi • par ticular cancers. 

January 3 1997 Dro PUbllCatl ° n "0/97/00085, published 

cells with "terfe ! ** ^ '""-'^lon of tumor 

-ecule, S ucraTt-T2. al rne a :; ^ ■ * 
20 treatments have proven e „r , Previously proposed 

approaches. Thus there o t n, . AVed ln ex vlvo 

p^ocj " ~ u •■ 

25 Summary 

30 expression, and methods for 7r paLr" J*" 1 " 1 ™* 

constructs and formulations. m'/ar S 1/ Ih/ 31 " 9 

relates to pi asmid constructs for delivery ^'^^ 

interferon alpha encoding nucleic acids to ,f therSpeUtic 

modulate tumor activitv h 8 ±n ° rder t0 

33 (-eluding combin^,^ ^ o^el ^ ~ S 

py with other agents, such as 
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cytokines, preferably IL-12), as well as methods for 
preparing such constructs. The pharmaceutical acceptable, 
cost effective and highly efficient delivery system 
presented herein represents an unanticipated improvement 
5 over the art. 

Thus, in a first aspect, the invention features a 
plasmid that contains a CMV promoter and optionally a 
synthetic 5' intron transcriptionally linked with an 
interferon alpha coding sequence, and a 3 ' -untranslated 
10 region (UTR) . Preferably the 3' UTR is a 3' growth hormone 
UTR. 

As used herein, the term "plasnfid" refers to a 
construct made up of genetic material (i.e., nucleic acids). 
It includes genetic elements arranged such that an inserted 

15 coding sequence can be transcribed in eukaryotic cells. 
Also, while the plasmid may include a sequence from a viral 
nucleic acid, such viral sequence does not cause the 
incorporation of the plasmid into a viral particle, and the 
plasmid is therefore a non-viral vector. Preferably a 

20 plasmid is a closed circular DNA molecule. 

"Cytomegalovirus promoter" refers to one or more 
sequences from a cytomegalovirus which are functional in 
eukaryotic cells as a transcriptional promoter and an 
upstream enhancer sequence. The enhancer sequence allows 

25 transcription to occur at a higher frequency from the 
associated promoter. 

In this context, "transcriptionally linked" means that 
in a system suitable for transcription, transcription will 
initiate under the direction of the control sequence (s) and 

30 proceed through sequences which are transcriptionally linked 
with that control sequence (s). Preferably no mutation is 
created in the resulting transcript, which would alter the 
resulting translation product. 

The term "coding region" or "coding sequence" refers to 
35 a nucleic acid sequence which encodes a particular gene 
product for which expression is desired, according to the 
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normal base pairing and codon usage relation k- 

t-h*. i ■ ' relationships. Thu<: 

the coding sequence must be placed in <,„m , 
transcriptional control .J^," "-^P to 

control elements and translating deluding 
5 termination codons, that a t and 

. Produce and wi ll result In' trl^ i ZZ ^ 
reading frame to produce a f unctional ^ ZZT^ 

in a preferred embodiment the interfp^n u 
sequence i «* ^ k interferon alpha coding 

nucleotide sequence of SEQ id NO- 11 or „ 6 
usage, such a «s t h» , semi -optimal codon 

ge such as the nucleotide sequence of SEQ ID NO ■ 12 

15 codino for h.,m=« ■ ^ natural sequences 

ng ror human interferon alpha Thi,« 


ID No!l2 * qUen " Sh ° Wn in SE Q 10 "0:11 or SEQ 

al ph a" h or "a" *° r the h ™ interferon 

Pfta or a human interferon aXpha coding sequence" is a 
nucleic acid sequence which encodes i-h. 

pr.f.r.M. tBat th . coalnj nM P « 1. 
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active as natural human interferon alpha. Thus, the 
polypeptide encoded by the interferon alpha coding sequence 
may differ from a natural human interferon alpha polypeptide 
by a small amount, preferably less than a 15%, 10%, 5%, or 
5 1% change. Such a change may be of one of more different 
types, such as deletion, addition, or substitution of one or 
more amino acids. 

The term ^transcriptional control sequence" refers to 
sequences which control the rate of transcription of a 
10 transcriptionally linked coding region. Thus, the term can 
include elements such as promoters, operators, and 
enhancers. For a particular transcription unit, the 

transcriptional control sequences will include at least a 
promoter sequence. 
15 a "growth hormone 3' untranslated region" is a sequence 

located downstream (i.e., 3' ) of the region encoding 
material polypeptide and including at least part of the 
sequence of the natural 3' UTR/poly(a) signal from a growth 
hormone gene, preferably the human growth hormone gene. 
20 This region is generally transcribed but not translated. 
For expression in eukaryotic cells it is generally 
preferable to include sequence which signals the addition of 
a poly-A tail. As with other synthetic genetic elements a 
synthetic 3* UTR/poly(A) signal has a sequence which differs 
25 from naturally-occurring UTR elements. 

The sequence may be modified, for example by the 
deletion of ALU repeat sequences. Deletion of such ALU 
repeat sequences acts to reduce the possibility of 
homologous recombination between the expression cassette and 
30 genomic material in a transfected cell. 

The plasmid preferably includes a promoter, a TATA box, 
a Cap site and a first intron and intron/exon boundary in 
appropriate relationship for expression of the coding 
sequence. The plasmid may also include a 5' mRNA leader 
35 sequence inserted between the promoter and the coding 
sequence and/or an intron/5* UTR from a chicken skeletal a- 


BNSDOCID: <WO 9947676A2 I » 


WO 99/47678 


PCT/US99/0S394 


8 

region, a first intron * fl - . -untranslated 

3-u ntranslated ~™ and * first 

intron is between the control wherein the f irst 

10 sequence; and , b) a aer „ S6qUenCe a " d the fi "t coding 

S econd tr an S : r r P Lo„ a s rrn t r; s iir ion unit cont — * 

linked with a second 5 ^: t \j;^™ transcriptional, 
i«ro„. a second coding ^ ' "j 910 "' 3 
-transited region/p oly (A) signal ^ tT^ ^ 
15 antron is between the control sequence and th SeC ° nd 
sequence; wherein the f irst and COdi " 9 

>° ^— - - - - rr::- - — 

r.*.r. T ^o t n u ^ d : rlPtl0n Unit " ^ ^ "ssette" 

-ding seqLnce alona "T^* " least <™ 

the in^atC and ter ™<* ^ 

^ transcription unit may noweter ^ °J r*'"^ * 
which „,a y inciude sequences nv Tl^T 
or Post-transiational processes In ^^^"'"•"iptional 
-e first transcriptio'na, ^^J^™-^"- 
transcriptional control e ~ q 6 ° r the second 

A « 5 . untr a nsl a ted region" or « 5 - UTR " . 
sequence lo ca ted 3- t-r, refers to a 

translation initiation codon 
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and therefore is not translated into a portion of the 
polypeptide product. 

For the plasmids described herein, one or more of a 
promoter, 5* untranslated region (5* UTR) , the 3' 
5 UTR/poly(A) signal, and introns may be a synthetic sequence. 
In this context the term "synthetic" means that the sequence 
is not provided directly by the sequence of a naturally 
occurring genetic element of that type but rather is an 
artificially created sequence (i.e., created by a person by 

10 molecular biological methods) . While one or more portions 
of such a synthetic sequence may be the same as portions of 
naturally occurring sequences, the full "sequence over the 
specified genetic element is different from a naturally 
occurring genetic element of that type.. The use of such 

15 synthetic genetic elements allows the functional 
characteristics of that element to be appropriately designed 
for the desired function. 

Thus, a "synthetic intron" refers to a sequence which 
is not a naturally occurring intron sequence but which will 

20 be removed from an RNA transcript during normal post 
transcriptional processing. Such introns can be designed to 
have a variety of different characteristics, in particular 
such introns can be designed to have a desired strength of 
splice site. 

25 A "subunit" of a therapeutic molecule is a polypeptide 

or RNA molecule which combines with one or more other 
molecules to form a complex having the relevant 
pharmacologic activity. Examples of such complexes include 
homodimers and heterodimers as well as complexes having 

30 greater numbers of subunits. A specific example of a 
heterodimer is IL-12, having the p40 and p35 subunits. 

The "p40 subunit" is the larger of the two subunits of 
the IL-12 heterodimer. Thus, it is capable of association 
with p35 subunit to form a molecule having activities 

35 characteristic of IL-12. Human p40 has the amino acid 
sequence of SEQ ID NO:l. Those skilled in the art will 
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c , uu poo. Such active => t r-^ ^ 

5 molecules are also regarded as P 40. altered 

Conversely, the " P 3S subunit" is the smaller of the two 
heterodyne subunits of IL - 12 . For humans , h « ^ 

10 s :ii e i a be 10W lGVel ^ alte — S '«« that sequence whi r 
iu still being regarded as p3S. 

in th A P " tiCUlar SXample of c °*™<3 regions suitable for use 
xn the plasmids of this invention are the natural sequences 
coding for the p 40 and p35 subunits of human I L - 12 ZT 

15 e n gi:ns Pr a e r f ; rred d . emb ° diment ^ — -din" 

regions are coding regions for those sequences anri 

preferably in the order P <0 then p 35 in' the 5 • 0 T 
direction. J 


25 


30 


35 


Thus, a 

IL-12 


IL 12- ■ Se ^ uence ^ing for the p40 subunit of human 

0 12 sub 1S .; nUClSiC aCid >**™" which encodes the human 
plirin , " d6SCribed abOVe ' on the normal base 

pairing and translational codon usage relationships. T he 
sequence coding f or P 35 subunit of human IL -1 2 is 

In a preferred embodiment the sequence coding for the 

the P 35 Un hUman IL " 12 ±S V t0 the - 

the p 35 subunit of human I L -i 2 . xhose skilled in « * 

will appreciate that the interferon alpha, P 35 subunit 
P subunit may all be on a single transcription unit, that 
any ITooT" ^ °" units, or tha 

and th Z 9 S6qUenCeS be °" °ne transcription unit 

and the other coding sequence on another transcription unit. 

soli, ld ^ alS ° COntai " a " intron h -ing variable 

splicing, a first coding sequence, and a second coding 
sequence, wherein the first and second coding sequence! 
include a sequence having the sequence of SEQ ID N0 -2 4 
or 25 coding for a human IX.-12 p 40 subunit, and a seguenc 
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having the sequence of SEQ ID NO: 6, 7, 8 or 24 coding for a 
human IL-12 p35 subunit. 

In a preferred embodiment, the plasmid also has: (a) a 
transcriptional control sequence transcriptionally linked 
5 with a first coding sequence and a second coding sequence; 
(b) a 5' -untranslated region; (c) an intron 5' to the first 
coding sequence; (d) an alternative splice site 3' to the 
first coding sequence and 5' to the second coding sequence; 
and (e) a 3 ' -untranslated region/poly (A) signal. The 

10 transcriptional control sequence preferably includes a 
cytomegalovirus promoter sequence. 

In a preferred embodiment, the plasmid also has: (a) a 
transcriptional control sequence transcriptionally linked 
with a first coding sequence, an IRES sequence, a second 

15 coding sequence, and a 3 1 -untranslated region/poly (A) 
signal, wherein the IRES sequence is between the first 
coding sequence and the second coding sequence; and (b) an 
intron between the promoter and the first coding sequence; 
wherein the first and second coding sequences include a 

20 sequence having the sequence of SEQ ID NO:2, 3, 4 or 25 
coding for a human IL-12 p4 0 subunit, and a sequence having 
the sequence of SEQ ID NO: 6, 7, 8 or 24 coding for a human 
IL-12 p35 subunit. The transcriptional control sequence 
preferably includes a cytomegalovirus promoter sequence and 

25 the IRES sequence preferably is from an encephalomyocarditis 
virus. 

For delivery of coding sequences for gene expression, 
it is generally useful to provide a delivery composition or 
delivery system which includes one or more other components 

30 in addition to the nucleic acid sequences. Such a 

composition can, for example, aid in maintaining the 
integrity of the DNA and/or in enhancing cellular uptake of 
the DNA and/or by acting as an immunogenic enhancer, such as 
by the non-DNA components having an immuno-st imulatory 

35 effect of their own. 


BNSDOCIO <WO S9J767BA2 I > 


WO 99/47678 


PCT/US99/05394 


12 


Thus, in another aspect t-ho • 

me pinc enhances the deliverv of +-k 
* -ol-cul. to mammalian cells in 1V ^ o ° f the »«^.ic acid 
nucleic acid molecule includes a dd P"^"bly the 

Product to be expressed TL ceil " 3 ^ 

relevant gene prod uct i s a nn ' ^ CaSes ' the 

Preferably the pinc „ Polypeptide or protein. 

io do es y n ot form c a rr s rt e h r at conditions s ° — «- 
« the time of ad^L^ri l;: is T r sent 

:r^,T:r ) si o t ;T;; s the I n pl i c t is — at — — - 

-I-C concentration I s . , r^." £ f "~ 7^ ^ 

» or less, 5% Qr less> 1 ; 8 " 1% f0 ; r -^«' 2 °* - ^ss, 

compositions differ in nom^ ^ Thus ' tnese 

effect from uses of the Se — ntration and funct i onaI 

compounds are used at h n" Slmllar C ° mP ° UndS in whi <=" the 
tne ethylene q i vcol 9 """"'""on., for example in 

- P^oplasL, or irfor^n^:: glT" °' ^ 
acid delivery. In general# fcne ^ S f ° r dru * or nucleic 

the conditions in 'which they are „ sZ ^ p" f0rm ^ 

-tain of tne compounds ^ ^ ^ ™s, though 
conditions. y m 9els und er some 

to an effect of t-h. • * ° r Protective" refers 

ect of the interaction between snr-h = 
a nucleic acid such that the rate of d « 
nucleic acid is decreased in degradation of the 

30 gradation may ^VVj^^ Such 
which specifically include th« dl "erent factors, 

nuclease. The Lot-* enzymatic action of a 

Afferent ,ay fo 1 T aCti ° n ^ >» Pr ° Vided 

-lecules or by ' excLToT " ^ eXClUSi ° n ° f th « 
^ uv exclusion of water. 

Some compounds which protect a „„„, ■ 
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interact or associate with the nucleic acid by 
intermolecular forces and/or valence bonds such as: Van der 
Waals forces, ion-dipole interactions, ion-induced dipole 
interactions, hydrogen bonds, or ionic bonds. These 
interactions may serve the following functions: (1) 
Stereoselectively protect nucleic acids from nucleases by 
shielding; (2) facilitate the cellular uptake of nucleic 
acid by "piggyback endocytosis" . Piggyback endocytosis is 
the cellular uptake of a drug or other molecule complexed to 
a carrier that may be taken up by endocytosis. CV Uglea and 
C Dumitriu-Medvichi, Medical Applications of Synthetic 
Oligomers, In: Polymeric Biomaterials , '"Severian Dumitriu 
ed., Marcel Dekker, Inc., 1993, incorporated herein by 
reference . 

To achieve the desired effects set forth it is 
desirable, but not necessary, that the compounds which 
protect the nucleic acid and/or prolong the bioavailability 
of a nucleic acid have amphiphilic properties; that is, have 
both hydrophilic and hydrophobic regions. The hydrophilic 
region of the compounds may associate with the largely ionic 
and hydrophilic regions of the nucleic acid, while the 
hydrophobic region of the compounds may act to retard 
diffusion of nucleic acid and to protect nucleic acid from 
nucleases . 

Additionally, the hydrophobic region may specifically 
interact with cell membranes, possibly facilitating 
endocytosis of the compound and thereby also of nucleic acid 
associated with the compound. This process may increase the 
pericellular concentration of nucleic acid. 

Agents which may have amphiphilic properties and are 
generally regarded as being pharmaceutical^ acceptable are 
the following: polyvinylpyrrolidones; polyvinylalcohols; 
polyvinylacetates; propylene glycol; polyethylene glycols; 
poloxamers (Pluronics) ; poloxamines (Tetronics); ethylene 
vinyl acetates; methylcelluloses, hydroxypropylcelluloses, 
hydroxypropylmethylcelluloses ; heteropolysaccharides 
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(pectins); chitosans; phosphatidylcholines (lecithins)- 
miglyols; polylactic acid; polyhydroxybutyric acid; xanthan 
gum. Also, copolymer systems such as polyethylene glycol- 
polylactic ac i d (PEG - PLA) , polyethylene glycol _ 

polyhydroxybutyric acid (PEG-phB) , polyvinylpyrrolidone- 
polyvinylalcohol (PVP-PVA), and derivatized copolymers such 
as copolymers of N-vinyl purine (or pyrimidine) derivatives 
and N-vinylpyrrolidone. However, not all of the above 
compounds are protective, interactive, non-condensing 
compounds as described below. 

In connection with the protective, interactive, non- 
condensing compounds for these compositions, the term "non- 
condensing" means that an associated nucleic acid is not 
condensed or collapsed by the interaction with the PINC at 
15 the concentrations used in the compositions. Thus, the 
PINCs differ in type and/or use concentration from such 
condensing polymers. Examples of commonly used condensing 
polymers include polylysine, and cascade polymers (spherical 
polycations) . 

20 Also in connection with such compounds and an 

associated nucleic acid molecule, the term "enhances the 
delivery" means that at least in conditions such that the 
amounts of P INC and nucleic acid is optimized, a greater 
biological effect is obtained than with the delivery of 
nucleic acid in saline. Thus, in cases where the expression 
of a gene product encoded by the nucleic acid is desired 
the level of expression obtained with the PINC: nucleic acid 
composition is greater than the expression obtained with the 
same quantity of nucleic acid in saline for delivery by a 
method appropriate for the particular PINC/coding sequence 
combination. 

In preferred embodiments of the above compositions, the 
PINC is polyvinyl pyrrolidone (PVP) , polyvinyl alcohol 
(PVA), a PVP-PVA co-polymer, N-methyl-2- P yrrolidone (NM2P) , 
ethylene glycol, or propylene glycol. m compositions 
which a Poloxamer (Pluronics) is used, the nucleic acid 
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preferably not a viral vector, i.e., the nucleic acid is a 
non-viral vector. 

In other preferred embodiments, the PINC is bound with 
a targeting ligand. Such targeting ligands can be of a 
variety of different types, including but not limited to 
galactosyl, residues, fucosal residues, mannosyl residues, 
carntitine derivatives, monoclonal antibodies, polyclonal 
antibodies, peptide ligands, and DNA-binding proteins. The 
targeting ligands may bind with receptors on cells such as 
antigen-presenting cells, hepatocytes, myocytes, epithelial 
cells, endothelial cells, and cancer cells. 

In connection with the association of a targeting 
ligand and a PINC, the term "bound with" means that the 
parts have an interaction with each other such that the 
15 physical association is thermodynamically favored, 
representing at least a local minimum in the free energy 
function for that association. Such interaction may involve 
covalent binding, or non-covalent interactions such as 
ionic, hydrogen bonding, van der Waals interactions, 
20 hydrophobic interactions, and combinations of such 
interactions . 

While the targeting ligand may be of various types, in 
one embodiment the ligand is an antibody. Both monoclonal 
antibodies and polyclonal antibodies may be utilized. 

25 The nucleic acid may also be present in various forms. 

Preferably the nucleic acid is not associated with a 
compounds (s) which alter the physical form, however, in 
other embodiments the nucleic acid is condensed (such as 
with a condensing polymer), formulated with cationic lipids, 

30 formulated with peptides, or formulated with cationic 
polymers . 

In preferred embodiments, the protective, interactive 
non-condensing compound is polyvinyl pyrrolidone, and/or the 
plasmid is in a solution having between 0.5% and 50% PVP, 
35 more preferably about 5% PVP. The DNA preferably is at 
least about 80% supercoiled, more preferably at least about 
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25 


30 


90% supercoiled, and most preferably at least about 95% 
supercoiled. 

In another aspect the invention features a composition 
containing a protective, interactive non-condensing compound 
and a plasmid containing an interferon alpha coding 
sequence . 

In yet another aspect, the invention provides a 
composition containing a plasmid of the invention (or a 
plasmid containing an interferon alpha coding sequence) and 
a cationic lipid with a neutral co-lipid. 

Preferably the cationic lipid is DOTMA and the neutral 
co-lipid is cholesterol (chol) . DOTMA is 1,2-di-O- 

octadecenyl-3-trimethylammonium propane, which is described 
and discussed in Eppstein et al., U.S. Patent 4,897,355 
15 issued January 30, 1990, which is incorporated herein by 
reference. However, other lipids and lipid combinations may 
be used in other embodiments. A variety of such lipids are 
described in Gao & Huang, 1995, Gene Therapy 2:110-122, 
which is hereby incorporated by reference. 
20 As the charge ratio of the cationic lipid and the DNA 

is also a significant factor, in preferred embodiments the 
DNA and the cationic lipid are present is such amounts that 
the negative to positive charge ratio is about 1:3. While 
preferable, it is not necessary that the ratio be 1:3. 
Thus, preferably the charge ratio for the compositions is 
between about 1:1 and 1:10, more preferably between about 
1:2 and 1:5. 

The term "cationic lipid" refers to a lipid which has a 
net positive charge at physiological pH/ and preferably 
carries no negative charges at such pH. An example of such 
a lipid is DOTMA. Similarly, "neutral co-lipid" refers to a 
lipid which has is usually uncharged at physiological pH . 
An example of such a lipid is cholesterol. 

Thus, "negative to positive charge ratio" for the DNA 
35 and cationic lipid refers to the ratio between the net 
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negative charges on the DNA compared to the net positive 
charges on the cationic lipid. 

As the form of the DNA affects the expression 
efficiency, the DNA preferably is at least about 80% 
5 supercoiled, more preferably at least about 90% supercoiled, 
and most preferably at least about 95% supercoiled. The 
composition preferably includes an isotonic carbohydrate 
solution, such as an isotonic carbohydrate solution that 
consists essentially of about 10% lactose. In preferred 

10 embodiments, the composition the cationic lipid and the 
neutral co-lipid are prepared as a liposome having an 
extrusion size of about' 800 nanometers/ Preferably the 
liposomes are prepared to have an average diameter of 
between about 20 and 800 nm, more preferably between about 

15 50 and 400 nm, still more preferably between about 75 and 
200 nm, and most preferably about 100 nm. Microf luidization 
is the preferred method of preparation of the liposomes. 

In another aspect the invention features a composition 
containing: (a) a first component having a plasmid including 

20 an interferon alpha coding sequence and a cationic lipid 
with a neutral co-lipid, wherein the cationic lipid is DOTMA 
and the neutral co-lipid is cholesterol, wherein the DNA in 
the plasmid and the cationic lipid are present in amounts 
such that the negative to positive charge ratio is about 

25 1:3; and (b) a second component including a protective, 
interactive non-condensing compound, wherein the first 
component is present within the second component. 

In another aspect, the invention provides a composition 
having a protective, interactive non-condensing compound, a 

30 first plasmid including an interferon alpha coding sequence, 
and one or more other plasmids independently having an IL-12 
p35 or IL-12 p40 subunit coding sequence. 

In another aspect, the invention features a method for 
making any of the plasmids described above by inserting a 

35 CMV promoter transcriptionally linked with an interferon 
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alpha coding sequence, and a growth hormone 3 • -untranslated 
region into a plasmid. 

The invention also provides methods of making the 
compositions described above. The method may involve- (a) 
preparing a DNA molecule having a transcriptional unit 
wherein the transcriptional unit contains an interferon 
alpha coding sequence; (b) preparing a protective 

interactive non-condensing compound; and (c) combining the 
protective, interactive non-condensing compound with the DNA 
in conditions such that a composition capable of delivering 
a therapeutically effective amount of an interferon alpha 
coding sequence to a mammal is formed. 

Preferably, the DNA molecule is a plasmid, wherein the 

15 StT* inCl r S 3 ^ Pr ° m0ter tr —iPtionally linked 
with an interferon alpha coding sequence, and a human growth 
hormone 3 ■ -untranslated region/poly (A) signal. 

The method may involve the steps of: (a) preparing a 
haVlng an interferon alpha coding sequence; (b) 
preparing a mixture of a cationic lipid and a neutral co- 
lipid, wherein the cationic lipid is DOTMA and the neutral 
co-li P i d is cholesterol; and (c) combining the mixture with 
the DNA in amounts such that the cationic lipid and the DNA 
are present in a negative to positive charge ratio of about 

25 In another embodiment, the method involves the steps 

of: (a) preparing a first component having a plasmid 
containing an interferon alpha coding sequence and a 
cationic lipid with a neutral co-li p i d , wherein the cationic 
lipid is DOTMA and the neutral co-lipid is cholesterol, 

30 wherein the DNA in the plasmid and the cationic lipid are 
present in amounts such that the negative to positive charge 
ratio is about 1:3; (b) preparing a second component having 
a protective, interactive non-condensing compound; and (c) 
combining the first and second components such that the 
resulting composition includes the first component within 
the second component. 
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In another embodiment, the method involves the steps 
of: (a) preparing a protective, interactive non-condensing 
compound, (b) preparing a first plasmid having an interferon 
alpha coding sequence, (c) preparing one or more other 
5 plasmids independently having an IL-12 p35 or IL-12 p4 0 
subunit coding sequence, and (d) combining the protective, 
interactive non-condensing compound, the plasmid having the 
interferon alpha coding sequence and the other plasmids. 

In another aspect, the invention provides a method for 
10 treatment of a mammalian condition or disease, by 
administering to a mammal suffering from the condition or 
disease a therapeutically effective amount of a plasmid as 
described herein. 

A ^therapeutically effective amount" of a composition 
15 is an amount which is sufficient to cause at least temporary 
relief or improvement in a symptom or indication of a 
disease or condition. Thus, the amount is also sufficient 
to cause a pharmacological effect. The amount of the 
composition need not cause permanent improvement or 
20 improvement of all symptoms or indications. A 
therapeutically effective amount of a cancer therapeutic 
would be one that reduces overall tumor burden in the case 
of metastatic disease (i.e., the number of metasteses or 
their size) or one that reduces the mass of the tumor in 
25 localized cancers. 

The condition or disease preferably is a cancer, such 
as epithelial glandular cancer, including adenoma and 
adenocarcinoma; squamous and transitional cancer, including 
polyp, papilloma, squamous cell and transitional cell 
30 carcinoma; connective tissue cancer, including tissue type 
positive, sarcoma and other (oma's); hematopoietic and 
lymphoreticular cancer, including lymphoma, leukemia and 
Hodgkin's disease; neural tissue cancer, including neuroma, 
sarcoma, neurofibroma and blastoma; mixed tissues of origin 
35 cancer, including teratoma and teratocarcinoma. Other 
cancerous conditions that are applicable to treatment 
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include cancer of any of the following: adrenal gland, anus, 
bile duct, bladder, brain tumors: adult, breast, cancer of 
an unknown primary site, carcinoids of the gastrointestinal 
tract, cervix, childhood cancers, colon and rectum, 
5 esophagus, gall bladder, head and neck, islet cell and other 
pancreatic carcinomas, Kaposi's sarcoma, kidney, leukemia, 
liver, lung: non-small cell, lung: small cell, lymphoma: 
AIDS-associated, lymphoma: Hodgkin's disease, Lymphomas: 
non-Hodgkin' s disease, melanoma, mesothelioma, metastatic 

10 cancer, multiple myeloma, ovary, ovarian germ cell tumors, 
pancreas, parathyroid, penis, pituitary, prostate, sarcomas 
of bone and soft tissue, skin, small intestine, stomach, 
testis, thymus, thyroid, trophoblastic disease, uterus: 
endometrial carcinoma, uterus: uterine sarcomas, vagina, or 

15 vulva. The composition preferably is administered by 

injection, although the method may also be performed ex 
vi vo. 

In another aspect, the invention provides a method for 
transfection (i.e., the delivery and expression of a gene to 

20 cells) of a cell in situ, by contacting the cell with a 
plasmid described herein for sufficient time to transfect 
the cell. Transfection of the cell preferably is performed 
in vivo and the contacting preferably is performed in the 
presence of about 5% PVP solution. 

25 In another aspect, the invention features a method for 

delivery and expression of an interferon alpha gene in a 
plurality of cells, by: (a) transfecting the plurality of 
cells with a plasmid or composition of the invention; and 
(b) incubating the plurality of cells under conditions 

30 allowing expression of a nucleic acid sequence in the 
vector, wherein the nucleic acid sequence encodes interferon 
alpha . 

In preferred embodiments, the interferon alpha is human 
interferon alpha and the cells are human cells and/or the 
35 contacting is performed in the presence of an about 5% PVP 
solution . 
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In another aspect, the invention features a method for 
treating a disease or condition, by transfecting a cell in 
situ with a plasmid or composition of the invention. The 
disease or condition can be a localized disease or condition 
5 or a systemic disease or condition. 

In another aspect, the invention features a cell 
transfected with a plasmid or composition of the invention. 

In yet another aspect, the invention features a method 
for treatment of a mammalian condition or disease, by 
10 administering to a mammal suffering from the condition or 
disease a therapeutically effective amount of a composition 
described herein. 

As the compositions are useful for delivery of a 
nucleic acid molecule to cells in vivo, in a related aspect 
15 the invention provides a composition at an in vivo site of 
administration. In particular this includes at an in vivo 
site in a mammal. 

In preferred embodiments the nucleic acid molecule 
includes a sequence encoding a gene product. Also' in 
20 preferred embodiments, the site of administration is in an 
interstitial space or a tissue of an animal, particularly: of 
a mammal. 

The invention also provides methods for using the above 
compositions. Therefore, in further related aspects, 

25 methods of administering the compositions are provided in 
which the composition is introduced into a mammal, 
preferably into a tissue or an interstitial space. 

Various methods of delivery may be utilized, such as 
are known in the art, but in preferred embodiments, the 
composition is introduced into the tissue or interstitial 
space by injection. The compositions may also be delivered 
to a variety of different tissues, but in preferred 
embodiments the tissue is muscle or tumor. 

In another related aspect, the invention provides 
35 methods for treating a mammalian condition or disease by 
administering a therapeutically effective amount of a 
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composition as described above. In preferred embodiments, 
the disease or condition is a cancer. 

The summary of the invention described above is non- 
limiting and other features and advantages of the invention 
5 will be apparent from the following detailed description of 
the preferred embodiments, as well as from the claims. 

Brief Description Of The Drawings 

Figure 1 shows the effects of interferon alpha in two 
cancer models. 

10 Figure 2 shows a plasmid map and sequence (SEQ ID 

NO: 18) for an exemplary IL-12 plasmid of the present 
invention. 

Figure 3 shows optimal codon usage for highly expressed 
human genes . 

15 Figure 4 shows a plasmid map and sequence (SEQ ID 

NO: 19) for plasmid pIF0836, an exemplary interferon alpha 
plasmid of the present invention. 

Figure 5 shows a plasmid map and sequence (SEQ ID 
NO: 20) for pIN096, an exemplary IL-12 plasmid that can be 
20 used with the present invention. 

Figure 6 shows the nucleic acid sequence (SEQ ID NO:21) 
of plasmid pIF0921, an exemplary interferon alpha plasmid of 
the present invention. 

Figures 7A and 7B show a plasmid map and sequence (SEQ 
25 ID NO:22) for plasmid pIF0921. 

Figure 8 shows an outline of a strategy that can be 
used to synthesize a pIF0921 plasmid. 

Figure 9 shows interferon alpha and IL-12 gene medicine 
(combination therapy) in Renca model. 

30 Detailed Description Of The Preferred Embodiments 

The present invention relates to gene delivery and gene 
therapy, and provides novel nucleic acid constructs for 
expression of interferon alpha in a mammal, formulations for 
delivery that incorporate a nucleic acid construct for 
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expression, and methods for preparing and using such 
constructs and formulations. In particular, this invention 
relates to plasmid constructs for delivery of therapeutic 
interferon alpha encoding nucleic acids to cells in order to 
5 modulate tumor activity, methods of using those constructs 
(including combination therapy with other agents, such as 
cytokines, preferably IL-12) , as well as methods for 
preparing such constructs. 

I . General 

10 As described, this invention concerns expression 

systems for the delivery and expression off interferon alpha 
coding sequences in mammalian cells, and formulations and 
methods for delivering such expression systems or other 
expression systems to a mammal. 
15 Therefore, particular genetic constructs are described 

which includes nucleotide sequences coding for interferon 
alpha, preferably human interferon alpha. Such a construct 
can beneficially be formulated and administered as described 
herein, utilizing the expression systems of this invention. 
20 To allow convenient production of such plasmids, it is 

generally preferable that the plasmid be capable of 
replication in a cell to high copy number. Generally such 
production is carried out in prokaryotic cells, particularly 
including Esherichia coli (E.coli) cells. Thus, the plasmid 
25 preferably contains a replication origin functional in a 
prokaryotic cell, and preferably the replication origin is 
one which will direct replication to a high copy number. 

It is also possible to utilize synthetic genetic 
elements in the plasmid constructs. 
30 As described below, these elements affect post- 

transcriptional processing in eukaryotic systems. Thus, the 
use of synthetic sequences allows the design of processing 
characteristics as desired for the particular application. 
Commonly, the elements will be designed to provide rapid and 
35 accurate processing. 
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For delivery of DNA encoding a desired expression 
product to a mammalian system, it is usually preferable to 
utilize a delivery system. Such a system can provide 
multiple benefits, notably providing stabilization to 
5 protect the integrity of the DNA, as well as assisting in 
cellular uptake. 

In addition, the non-DNA components of the formulation 
may contribute to an immune system enhancement or 
activation. As a result, components of a delivery system 
10 can be selected in conjunction with a particular gene 
product to enhance or minimize the immuno-stimulatory 
effect. - 

The plasmids may also include elements for expression 
of IL-12 or one or more subunits thereof. Similarly, the 
15 treatment may involve administration of an interferon alpha 
coding sequence and one or more IL-12 coding sequences 
whether on a single plasmid or on separate plasmids. Such 
Plasmids may be incorporated into compositions for delivery 
with a protective, interactive non-condensing compound, a 
cationic lipid and neutral co-lipid, or both. 

While these are specific effective examples, other 
components may be utilized in formulations containing the 
interferon alpha expression vectors of the present invention 
to provide effective delivery and expression of interferon 
alpha or with other coding sequences for which manipulation 
of the activation of immune system components is desirable. 

The selection of delivery system components and 
preparation methods in conjunction with the selection of 
coding sequences provides the ability to balance the 
specific effects of the encoded gene products and the immune 
system effects of the overall delivery system composition. 
This capacity to control the biological effects of delivery 
system formulation administration represents an aspect of 
the invention in addition to the interferon alpha encoding 
constructs and specific formulations of delivery systems 
Co-selection of the encoded gene product and the delivery 
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system components and parameters provides enhanced desired 
effects rather than merely providing high level gene 
expression. In particular, such enhancement is described 
below for the antitumor effects of the exemplary PVP 
containing compositions. 

For systems in which IL-12 is also administered, the 
antitumor effect can be greater than merely additive {i.e., 
greater than merely the sum of the antitumor effects of 
interferon alpha alone and IL-12 alone) . Enhancement of 
immuno-stimulatory effects is also desirable in other 
contexts, for example, for vaccine applications. 

In contrast, for certain applications, it is preferable 
to select a delivery systems which minimizes the immune 
system effects. For example, it is often preferred that the 
immune system activation be minimized for compositions to be 
delivered to the lung in order to minimize lung tissue 
swelling. 

A useful approach for selecting the delivery system 
components and preparation techniques for a particular 
coding sequence can proceed as follows, but is not limited 
to these steps or step order. 

1- Select a particular genetic construct which 
provides appropriate expression in vitro. 

2. Select delivery system components based on desired 
immunostimulatory effects and/or on in vivo 
physiological effect. Such effects can be tested 
or verified in in vivo model systems. 

3. Select the other delivery system parameters 
consistent with the desired immuno-stimulatory 
effects and/or consistent with enhancing the 
desired in vivo physiological effect. Such 
parameters can, for example, include the amount 
and ratio of DNA to one or more other composition 
components, the relative amounts of non-DNA 
composition components, the size of delivery 
system formulation particles, the percent 
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supercoiled DNA for circular dsDNA vectors, and 
the specific method of preparation of delivery 
system formulation particles. The particular 
parameters relevant for specific types of 
5 formulations will be apparent or readily 

determined by testing. 
The description below illustrates the selection of 
components and parameters in the context of interferon alpha 
encoding constructs. However, it should be recognized that 
10 the approach is applicable to constructs containing a 
variety of other coding sequences. 

11 • Plasmid Construct Expression Systems 
A - Plasmid Design and Construction 

For the methods and constructs of this invention, a 
15 number of different plasmids were constructed which are 
useful for delivery and expression of sequences encoding 
interferon alpha. Thus, these plasmids contain coding 
regions for interferon alpha, along with genetic elements 
necessary or useful for expression of those coding regions. 

While these embodiments utilized interferon alpha cDNA 
clones or partial genomic sequences from a particular 
source, those skilled in the art could readily obtain 
interferon alpha coding sequences from other sources, or 
obtain a coding sequence by identifying a cDNA clone in a 
library using a probe (s) based on the published interferon 
alpha coding and/or flanking sequences. This also applies 
to the 1L-12 coding sequences utilized in the embodiments 
described herein. 

Coding sequences for interferon alpha were incorporated 
30 ln to an expression plasmid that contains eukaryotic and 
bacterial genetic elements. Eukaryotic genetic elements 
include the CMV immediate early promoter and 5' UTR, and a 
human growth hormone 3' UTR/poly(a) signal, which influence 
gene expression by controlling the accuracy and efficiency 
35 of RNA processing, mRNA stability, and translation. 
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The human growth hormone 3' UTR is from a human growth 
hormone gene, and preferably includes a poly (a) signal. 
This sequence can be linked immediately following the 
natural translation termination codon for a cDNA sequence, 
5 genomic sequence, modified genomic sequence, or synthetic 
sequence coding for interferon alpha. 

An example of a human growth hormone 3' UTR/poly(a) 
signal is shown by the human growth hormone 3' UTR 
(nucleotides 1 - 100) and 3' flanking sequence (nucleotides 
10 101 - 191; GenBank accession #J03071, HUMGHCSA) below. 
(SEQ ID NO: 13) 


15 


30 


35 


1 


51 


GGGTGGCATCCCTGTGACCCCTCCCCAGTGCCTCTCCTGGCCCTGGAAGT 

Poly (a) signal 


TGCCACTCCAGTGCCCACCAGCCTTGTCCT AATAAA ATTAAGTXGCATCA 
101 TTTTGTCTGACTAGGTGTCCTTCTATAATATTATGGGGTGGAGGGGGGTG , 


20 151 


GTATGGAGCAAGGGGCAAGTTGGGAAGACAACCTGTAGGGC 
The 5' and 3' UTR and flanking regions can be further 
and more precisely defined by routine methodology, e.g., 
deletion or mutation analysis or their equivalents., and can 
be modified to provide other sequences having appropriate 
25 transcriptional and translational functions. 

1. Construction of plasmid: Plasmid Backbone, human 
interferon alpha cDNA, Final Construct 

A diagrammatic representation of the PCR products and 
plasmids involved in creation of an exemplary construct is 
shown below in Figure 8. 

Plasmid pIF0921 was constructed from commercially 
available plasmids, and contains the TN5 gene encoding the 
kanamycin resistance gene, the pUC origin of replication, 
the CMV enhancer and promoter to base +112, a synthetic 
intron called IVS8, the human IFN-a2b gene, and the human 
growth hormone 3' UTR. The plasmid construction descendancy 
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for pIL0697 is shown in Figure 8. pIL0697 was cut with 
BamHI and Xba I and the hIFN-a2b PCR product, which had been 
amplified from human genomic DNA with BamHI and Xba I ends, 
was cloned into the pIL0697 backbone in place of the IL-2 
5 coding region. The resulting plasmid was pIF0863. pIF0863 
was cut with Nco I and intron IVS8 from pCT0828 was cloned 
in. The resulting plasmid was pIF0890. pIF0890 was cut 
with Nde I and Pac I and an additional region of the CMV 5' 
UTR to base +112 was cloned in from plasmid pLC0888. 

10 B - Synthetic Genetic Elements 

In some embodiments, some or all of the genetic 
elements can be synthetic, derived from synthetic 
oligonucleotides, and thus are not obtained directly from 
natural genetic sequences. These synthetic elements are 
appropriate for use in many different expression vectors. 

A synthetic intron is designed with splice sites that 
ensure that RNA splicing is accurate and efficient. A 
synthetic 3' UTR/poly(A) signal is designed to facilitate 
mRNA 3' end formation and mRNA stability. A synthetic 5* 
UTR is designed to facilitate the initiation of translation. 
The design of exemplary synthetic elements is described in 
more detail below. 

1 - Summary of Synthetic Element Features 
Exemplary synthetic 5 'UTR, intron, and 3 • UTR/poly (A) 
25 signal have the general features shown below: 

5' UTR Short. 

Lack of secondary structure. 

Kozak sequence. 

Site for intron insertion. 
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Intron 5* splice site sequence matches 

consensus . 

5 1 splice site sequence is exactly 
complementary to 5 1 end of Ul 
snRNA. 

Branch point sequence matches 
consensus . 

Branch point sequence is 
complementary to U2 snRNA. 
3* splice site matches consensus. 
Polypyrimidine tract is 16 bases in 
length and contains 7 -consecutive 
T's. (The tract preferably 
contains at least 5 consecutive 
T" s. ) 

Contains internal restriction 
enzyme sites. 

Bbsl cleaves at the 5'ss, Earl 
cleaves at the 3'ss. 

3* UTR/Poly(A) Based on rabbit p-globin 3' 

UTR/poly (A) signal . 

Consists of two poly (A) signals in 
tandem. 


2 . Features of the Synthetic 5 * UTR (UT6) : 
The 5' untranslated region (5' UTR) influences the 
translational efficiency of messenger RNA, and is therefore 
an important determinant of eukaryotic gene expression. The 
synthetic 5 ' UTR sequence (UT6) has been designed to 
maximize the translational efficiency of mRNAs encoded by 
vectors that express genes of therapeutic interest. 

The sequence of the synthetic 5' UTR (UT6) is shown 
below. The Kozak sequence is in boldface and the initiation 
codon is double underlined. The location of the intron 
(between residues 48 and 49) is indicated by the filled 
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10 


triangle and the sequences that form the exonic portion of 
consensus splice sites are single underlined. The 
restriction sites for Hindlll and Ncol are overlined. (SEQ 
ID NO:14) 

HindIJI V Ncol 

AAGCTTACTCAACACAATAACAAACTTACTTACAATCTTAATTAACAGGCCACCATGG 

The 5' untranslated region (5' UTR), located between 
the cap site and initiation codon, is known to influence the 
efficiency of mRNA translation. Any features that influence 
the accessibility of the 5' cap structure to initiation 
factors, the binding and subsequent migration of the 43S 
preinitiation complex, or the recognition of the initiation 
15 codon, will influence mRNA translatability . An efficient 5* 
UTR is expected to be one that is moderate in length, devoid 
of secondary structure, devoid of upstream initiation 
codons, and has an AUG within an optimal local context 
(Kozak, 1994, Biochimie 76:815-821; Jansen et al., 1994). A 
20 5' UTR with these characteristics should allow efficient 
recognition of the 5' cap structure, followed by rapid and 
unimpeded ribosome scanning by the ribosome, thereby 
facilitating the translation initiation process. 

The sequence of the synthetic 5 'UTR was designed to be 
25 moderate in length (54 nucleotides (nts) ) , to be deficient 
in G but rich in C and A residues, to lack an upstream ATG, 
to place the intended ATG within the context of a optimal 
Kozak sequence (CCACCATGG) , and to lack potential secondary 
structure. The synthetic 5' UTR sequence was also designed 
30 to lack AU-rich sequences that target mRNAs to be rapidly 
degraded in the cytoplasm. 

Experiments have demonstrated that introns increase 
gene expression from cDNA vectors, and that introns located 
in the 5' UTR are more effective than ones located in the 3' 
35 UTR (Huang and Gorman, 1990, Mol. Cell. Biol. 10:1805-1810; 
Evans and Scarpulla, 1989, Gene 84:135-142; Brinster et al., 
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1988, Proc. Natl. Acad. Scl. USA 85:836-840; Palmiter et 
al., 1991, Proc. Natl. Acad. Sci . USA 88:478-482; Choi et 
al., 1991, Mol. Cell. Biol. 11:3070-3074). Accordingly, the 
synthetic 5* UTR sequence was designed to accommodate an 
5 intron with consensus splice site sequences. The intron 
may, for example, be located between residues 4 8 and 4 9 (See 
intron sequence structure below). The CAG at position 46-48 
is the exonic portion of a consensus 5' splice site. The G 
at position 4 9 is the exonic portion of a consensus 3* 
10 splice site . 

To facilitate cloning manipulations, the synthetic 5' 
UTR sequence was designed to begin with a Hindlll site and 
terminate with a Ncol site. 

3. Features of the Synthetic Intron 
15 RNA splicing is required for the expression of most 

eukaryotic genes. For optimal gene expression, RNA splicing 
must be highly efficient and accurate. A synthetic intron, 
termed OPTIVS8B, was designed to be maximally efficient and 
accurate. 

20 The structure of the exemplary synthetic intron, 

OPTIVS8 is shown below. Sequences for the 5' splice site 
(5'ss), branch point (bp), and 3' splice site (3'ss) are 
double underlined. The recognition sequences for the 
restriction enzymes Bbsl and Earl are overlined. The 

25 cleavage site for Bbsl corresponds to the 5'ss, and the 
cleavage site for Earl corresponds to the 3'ss.. 

5'ss bp 3'ss 

I Bbsl | Earl | 

30 5* CAG GTAAGT GTCTTC (77) TACTAAC GG TTCTTTTTTTCTCTTCACAG G 3' 

(SEQ ID NO. 15) (SEQ ID NO. 16) 

The 5* splice site (5'ss) sequence matches the 
established consensus sequence, MAG □ GTRAGT, where M = C or 
A, and R = G or A. Since the mechanism of splicing involves 
35 an interaction between the 5'ss of the pre-mRNA and Ul 
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snRNA, the 5 ■ ss sequence of OPTIVS8B was chosen to be 
exactly complementary to the 5' end of Ul snRNA. 

5*ss 5' CAGGUAAGU 3* 

I It I I f I I | 
Ul RNA 3* GUCCAUUCA 5* 

In mammals, the consensus sequence for branch points 
(YNYTRAY, where Y = C or T, R = A or G, N = any base, and 
the underlined A residue is the actual branch point) is very 
ambiguous. Since the mechanism of splicing involves an 
interaction between the branch point (bp) of the pre-mRNA 
and U2 snRNA, the branch point sequence of OPTIVS8B was 
chosen to maximize this interaction. (Note that the branch 
point itself is bulged out). The chosen sequence also 
matches the branch point sequence that is known to be 
obligatory for pre-mRNA splicing in yeast. The branch point 
is typically located 18-38 nts upstream of the 3' splice 
site. in OPTIVS8B, the branch point is located 24 nts 
upstream from the 3' splice site. 

BP 5' UACUA£C 3' 

I I I I I f 
U2 RNA 3' AUGAU G 5' 

The sequence of the 3' splice site (3'ss) matches the 
established consensus sequence, Y X1 NYAG I G, where Y = C or 
T, and N = any base. In 3' splice sites, the polypyrimidine 
tract (Y n ) is the major determinant of splice site strength. 
For optimal splice site function in OPTIVS8B, the length of 
the polypyrimidine tract was extended to 16 bases, and its 
sequence was adjusted to contain 7 consecutive T residues. 
This feature was included because Roscigno et al . (1993) 
demonstrated that optimal splicing requires the presence of 
at least 5 consecutive T residues in the polypyrimidine 
tract. 
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Splicing in vitro is generally optimal when introns are 
>80 nts in length (Wieringa, et al., 1984; Ulfendahl et al., 
1985, Nucl. Acids Res. 13:6299-6315). Although many introns 
may be thousands of bases in length, most naturally 
5 occurring introns are 90-200 nt in length (Hawkins, 1988, 
Nucl. Acids Res. 16:9893-9908). The length of the 
synthetic intron (118 nts) falls within this latter range. 

OPTIVS8B was designed with three internal restriction 
enzyme sites, Bbsl, Nhel, and Earl. These restriction sites 

10 facilitate the screening and identification of genes that 
contain the synthetic intron sequence. In addition, the 
Bbsl and Earl sites were placed so that their cleavage sites 
exactly correspond to the 5'ss (Bbsl) or 3'ss (Earl). The 
sequence of the polypyrimidine tract was specifically 

15 designed to accommodate the Earl restriction site. 
Inclusion of the Bbsl and Earl sites at these locations is 
useful because they permit the intron to be precisely 
deleted from a gene. They also permit the generation of an 
"intron cassette" that can be inserted at other locations 

20 within a gene. 

The 77 bases between the Bbsl site and the branch point 
sequence are random in sequence, except for the inclusion of 
the Nhel restriction site. 


4. Features of the Synthetic 3* UTR/poly(A) 

25 Signal : 

The 3' ends of eukaryotic mRNAs are formed by the 
process of polyadenylation . This process involves site 
specific site RNA cleavage, followed by addition of a 
poly (A) tail. RNAs that lack a poly (A) tail are highly 

30 unstable. Thus, the efficiency of cleavage/polyadenylation 
is a major determinant of mRNA levels, and thereby, of gene 
expression levels. 2XPA1 is a synthetic sequence, containing 
two efficient poly (A) signals, that is designed to be 
maximally effective in polyadenylation. 
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A poly (A) signal is required for the formation of the 
3' end of most eukaryotic mRNA. The signal directs two RNA 
processing reactions: site-specific endonucleolytic cleavage 
of the RNA transcript, and stepwise addition of adenylates 
5 (approximately 250) to the newly generated 3- end to form 
the poly (A) tail. a poly (A) signal has three parts - 
hexanucleotide, cleavage site, and downstream element. The 
hexanucleotide is typically AAUAAA and cleavage sites are 
most frequently 3- to the dinucleotide CA (Sheets et al . , 
10 1987). Downstream elements are required for optimal poly (A) 
signal function and are located downstream of the cleavage 
site. The sequence requirement for downstream elements is 
not yet fully established, but is generally viewed as OG- or 
U-rich sequences (Wickens, 1990; Proudfoot, 1991 Cell 
15 64:671-674; Wahle, 1992, Bioessays 14:113-118; Chen and 
Nordstrom, 1992, Wucl . Acids Res. 20:2565-2572). 

Naturally occurring poly (A) signals are highly variable 
in their effectiveness (Peterson, 1992). The effectiveness 
of a particular poly (A) signal is mostly determined by the 
quality of the downstream element. (Wahle, 1992). m 
expression vectors designed to express genes of therapeutic 
interest, it is important to have a poly (A) signal that is 
as efficient as possible. 

Poly (A) efficiency is important for gene expression, 
25 because transcripts that fail to be cleaved and 
polyadenylated are rapidly degraded in the nuclear 
compartment. In fact, the efficiency of polyadenylation in 
liv ln g cells is difficult to measure, since 
nonpolyadenylated RNAs are so unstable. In addition to 
being required for mRNA stability, poly (A) tails contribute 
to the translatability of mRNA, and may influence other RNA 
processing reactions such as splicing or RNA transport 
((Jackson and Standart, 1990, Cell 62:15-24; Wahle, 1992). 

Some eukaryotic genes have more than one poly (A) site, 
implying that if the cleavage/polyadenylation reaction fails 
to occur at the first site, it will occur at one of the 


20 


30 


35 
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later sites. In COS cell transfection experiments, a gene 
with two strong poly (A) sites yielded approximately two-fold 
more mRNA than one with a single strong poly (A) site 
(Bordonaro, 1995) . These data suggest that a significant 
5 fraction of transcripts remain unprocessed even with a 
single "efficient" poly (A) signal. Thus, it may be 

preferable to include more than one poly (A) site. 

The sequence of the exemplary synthetic poly (A) signal 
is shown below. The sequence is named 2XPA. The 

10 hexanucleotide sequences and downstream element sequences 
are double underlined, and the two poly (A) sites are labeled 
as pA#l and pA#2 . Convenient restriction sites are 

overlined. The entire 2XPA unit may be transferred in 
cloning experiments as a Xbal-Kpnl fragment. Deletion of 

15 the internal BspHI fragment results in the formation of a 
1XPA unit. (SEQ ID NO. 17) 

Xbal BspHI 

TCTAGAGCATTTTTCCCTCTGCCAAAAATTATGGGGACATCATGAAGCCCCTTGAGCATCTGACG 
20 pA#l 

Hex I Downstream element 

TCTGGCT AATAAA GGAAATTTATTTTCATTGCAATA GTGTGTTGGAATTTTTTGTGT CTCTCACT 

BspHI 

25 CGG T AC T AG AGC AT T T T T CC C TC T GC C AAAAAT T AT GGGG AC AT CAT G AAGCC C C T T G AGC AT C T 

pA#2 

Hex I Downstream element 

G AC GT CT GGCT AAT AAA G G AAAT T T ATT T T CAT T GC AAT AGT GT GT T GG AAT TTTTTGTGT CTCT 

30 Kpnl 
CACTCGGTACC 

The sequence of the synthetic poly (A) site shown above 
is based on the sequence of the rabbit D-globin poly (A) 
35 signal, a signal that has been characterized in the 
literature as strong (Gil and Proudfoot, 1987, Cell 49:399- 
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406; Gil d p roudfoQt/ 1984> 312;473 _ 47 

k ey features is the structure of its downstream element 
which contains both UG- and 0-rich domains 

A double-stranded DNA sequence corresponding to the 


1XPA sequence 


" constructed from synthetic 

3tT ideS - C ° PieS ° f 1XPA — t"en 

oxned to form the 2XPA sequence. The sequences were joined 
» such as way as to have a un i que xbal site at the 5- end 
of the first poly(A) signal containing fragment and ° 

Z^^ f Site at the 3> end of the - iiy«> sCa 

containing fragment. 


10 


^ I" te ^o n Alpha and IT-1? r^ n ^-p,n^, 

alpha cod"" 160 "" 6 SeqUenCe ° f 3 natUral hUman interferon 
15 !i SSqUenCes is kn °™< and is provided below 

te n r 9 fe W ron ^ ^ '~ h ~ 

interferon alpha. The same applies with respect to the 
IL-12 coding sequences. 

for int'T CaSGS ' inSt6ad ° f natUral S ^ UenCe coding 

20 f ° r t ./ nterfer0n ^a, it is advantageous to utilij 

sT he e t 1C SeqU6nCeS ^ 

nal al s SeqUenCeS ^ • lt «™ t - *r«n the 

"cleotide qUenCS ' thUS "«"y different 

nucleotide sequences from the natural sequence m 

25 ZTT! SV T tiC SeqUenCSS ^ USed Which »»- codon 

usage at least partially optimized for expression in a 

human. The natural sequences do not have such' optimal codo 

:;;iLed. ferabiy ' — - - e a re 

Optimal codon usage in humans is indicated by codon 
30 usage frequencies for highly expressed h„m a „ 

• y ly ex P r essed human genes, as shown 

"Human H- 1' ^ ""^ Chart ia ^ ^ pro,™. 

PacZ; I R fr ° m WiSC ° nSin Se — *- lys is 

The cod T ' GenStiCS C ° mPUter Gr °^' Madison, WI. 

The codons which are most frequently used in hiahlv 
35 expressed human genes are presumptively the optimal cod'ons 
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for expression in human host cells, and thus form the basis 
for constructing a synthetic coding sequence. An example of 
a synthetic interferon alpha coding sequence is shown as the 
bottom sequence in the table below. 
5 However, rather than a sequence having fully optimized 

codon usage, it may be desirable to utilize an interferon 
alpha encoding sequence which has optimized codon usage 
except in areas where the same amino acid is too close 
together or abundant to make uniform codon usage optimal. 

10 In addition, other synthetic sequences can be used 

which have substantial portions of the codon usage 
optimized, for example, with at least 50%, 70%, 80% or 90% 
optimized codons as compared to a natural coding sequence. 
Other particular synthetic sequences for interferon alpha 

15 can be selected by reference to the codon usage chart in 
Fig. 3. A sequence is selected by choosing a codon for each 
of the amino acids of the polypeptide sequences. DNA 
molecules corresponding to each of the polypeptides can then 
by constructed . by routine chemical synthesis methods. For 

20 example, shorter oligonucleotides can be synthesized, and 
then ligated in the appropriate relationships to construct 
the full-length coding sequences. 

The following sequences are provided in the sequence 
listing herein: interferon alpha amino acid sequence, SEQ 

25 ID NO: 9; interferon alpha wild type nucleic acid sequence, 
SEQ ID NO: 10; interferon alpha synthetic nucleic acid 
sequence with optimized codon usage, SEQ ID NO: 11; 
interferon alpha nucleic acid sequence with additional/semi- 
optimized codon usage, SEQ ID NO: 12; IL-12 p40 subunit amino 

30 acid sequence, SEQ ID N0:1; IL-12 p40 wild type nucleic acid 
sequence, SEQ ID NO: 2; IL-12 p40 synthetic nucleic acid 
sequence with all codons optimized, SEQ ID NO: 3; IL-12 p40 
subunit nucleic acid sequence with all codons optimized 
except when same nucleic acids were too close/abundant, SEQ 

35 ID NO: 4; IL-12 p35 amino acid sequence, SEQ ID NO: 5; IL-12 
p35 wild type nucleic acid sequence, SEQ ID NO: 6; IL-12 p35 
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25 


30 


SEO in L T SeqU6nCe With 311 COdons °Ptimi 2 eC 

SEQ ' IL " 12 subunit nucleic acid sequence with 

all codons optimized except when same nucleic acids were too 
close/abundant, SEQ id NO: 8. Those skilled in the art will 
realxze that various nucleic acid sequences with optimized 
codon usage can be constructed, for example based on the 
various combinations shown below, wherein optimal usage for 
each codon is shown below the IL-12 p35 and p40 subunit wild 
type sequences and the interferon alpha wild type sequence. 

Sequences Enc oding Human IL-12 p .TS 

First line = natural sequence (SEQ ID-NO. 6) 
Second line = all codons optimized (SEQ ID NO 7) 

nucleic"" -!< ine = COd ° nS ° PtiniiZed — Pt "hen same 

nucleic acids were too close/abundant (changes between 
second and third lines bolded) (SEQ ID NO. 8) 

TTG GCC AGA AAC CTC CCC GTG GCC ACT CCA GAC CCA GGA ATG TTC CCA TGC CTT CAC CAC 
CTG GCC CGC AAC CTG CCC GTG GCC ACC CCC GAC CCC GGC ATG TTC CCC TGC CTG CAC CAC 

TCC CAA AAC CTG CTG AGG GCC GTC AGC AAC ATG CTC CAG AAG GCC AGA CAA ACT CTA GAA 
AGC CAG AAC CTG CTG GCG GCC GTG AGC AAC ATG CTG CAG AAG GCC GCG CAG ACC CTG GAG 
TTT TAC CCT TGC ACT TCT GAA GAG ATT GAT CAT GAA GAT ATC ACA AAA GAT AAA ACC AGC 


ACC GTG GAG GCC TGC CTG CCC CTG GAG CTG ACC AAG AAC GAG AGC TGC CTG AAC AGC CGC 
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5 


GAG 

ACC 

TCT 

TTC 

ATA 

ACT 

AAT 

GGG 

AGT 

TGC 

CTG 

GCC 

TCC 

AGA 

AAG 

ACC 

TCT 

TTT 

ATG 

ATG 

GAG 

ACC 

AGC 

TTC 

ATC 

ACC 

AAC 

GGC 

AGC 

TGC 

CTG 

GCC 

AGC 

CGC 

AAG 

ACC 

AGC 

TTC 

ATG 

ATG 

GAG 

ACC 

TCC 

TTC 

ATC 

ACC 

AAC 

GGC 

ACT 

TGC 

CTG 

GCC 

TCC 

CGC 

AAG 

ACC 

AGC 

TTC 

ATG 

ATG 

GCC 

CTG 

TGC 

CTT 

AGT 

AGT 

ATT 

TAT 

GAA 

GAC 

TTG 

AAG 

ATG 

TAC 

CAG 

GTG 

GAG 

TTC 

AAG 

ACC 

GCC 

CTG 

TGC 

CTG 

AGC 

AGC 

ATC 

TAC 

GAG 

GAC 

CTG 

AAG 

ATG 

TAC 

CAG 

GTG 

GAG 

TTC 

AAG 

ACC 

GCC 

CTG 

TGC 

CTG 

AGC 

TCC 

ATC 

TAC 

GAG 

GAC 

CTG 

AAG 

ATG 

TAC 

CAG 

GTG 

GAG 

TTC 

AAG 

ACC 


ATG 

AAT 

GCA 

AAG 

CTT 

CTG 

ATG 

GAT 

CCT 

AAG 

AGG 

CAG 

ATC 

TTT 

CTA 

GAT 

CAA 

AAC 

ATG 

CTG 

ATG 

AAC 

GCC 

AAG 

CTG 

CTG 

ATG 

GAC 

CCC 

AAG 

CTC 

CAG 

ATC 

TTC 

CTG 

GAC 

CAG 

AAC 

ATG 

CTG 

ATG 

AAC 

GCC 

AAG 

CTC 

CTG 

ATG 

GAC 

CCC 

AAG 

CTC 

CAG 

ATC 

TTC 

QTG 

GAC 

CAG 

AAC 

ATG 

CTG 


GCA 

GTT 

ATT 

GAT 

GAG 

CTG 

ATG 

CAG 

GCC 

CTG 

AAT 

TTC 

AAC 

AGT 

GAG 

ACT 

GTG 

CCA 

CAA 

AAA 

GCC 

GTG 

ATC 

GAC 

GAG 

CTG 

ATG 

CAG 

GCC 

CTG 

AAC 

TTC 

AAC 

AGC 

GAG 

ACC 

GTG 

CCC 

CAG 

AAG 

GCC 

GTG 

ATC 

GAC 

GAG 

CTG 

ATG 

CAG 

GCC 

CTG 

AAC 

TTC 

AAC 

AGC 

GAG 

ACC 

GTG 

CCC 

CAG 

AAG 


TCC 

TCC 

CTT 

GAA 

GAA 

CCG 

GAT 

TTT 

TAT 

AAA 

ACT 

AAA 

ATC 

AAG 

CTC 

TGC 

ATA 

CTT 

CTT 

CAT 

AGC 

AGC 

CTG 

GAG 

GAG 

CCC 

GAC 

TTC 

TAC 

AAG 

ACC 

AAG 

ATC 

AAG 

CTG 

TGC 

ATC 

CTG 

CTG 

CAC 

AGC 

AGC 

CTG 

GAG 

GAG 

CCC 

GAC 

TTC 

TAC 

AAG 

ACC 

AAG 

ATC 

AAG 

CTG 

TGC 

ATC 

CTG 

CTG 

CAC 


GCT 

TTC 

AGA 

ATT 

CGG 

GCA 

GTG 

ACT 

ATT 

GAC 

AGA 

GTG 

ACG 

AGC 

TAT 

CTG 

AAT 

GCT 

TCC 

TAA 

GCC 

TTC 

CGC 

ATC 

CGC 

GCC 

GTG 

ACC 

ATC 

GAC 

CGC 

GTG 

ACC 

AGC 

TAC 

CTG 

AAC 

GCC 

ACC 

TGA 

GCC 

TTC 

CGC 

ATC 

CGG 

GCC 

GTG 

ACC 

ATC 

GAC 

CGC 

GTG 

ACC 

AGC 

TAC 

CTG 

AAC 

GCC 

ACG 

TGA 


25 Additional Optimized Sequences Coding For IL-12 p35 Subunit 
(Second Line = SEQ ID NO: 24) 

10 20 
Met Cys Pro Ala Arg Ser Leu Leu Leu Val Ala Thr Leu Val Leu Leu Asp His Leu Ser 
ATG TGY CCN GCN MGN WSN YTN YTN YTN GTN GCN ACN YTN GTN YTN YTN GAY CAY YTN WSN 

30 

ATG TGT CCT GCT CGT TCT TTA TTA TTA GTT GCT ACT TTA GTT TTA TTA GAT CAT TTA TCT 

TGC CCC GCC CGC TCC TTG TTG TTG GTC GCC ACC TTG GTC TTG TTG GAC CAC TTG TCC 

CCA GCA CGA TCA CTT CTT CTT GTA GCA ACA CTT GTA CTT CTT CTT TCA 

CCG GCG CGG TCG CTC CTC CTC GTG GCG ACG CTC GTG CTC CTC CTC TCG 
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20 


25 


AGA ACT CTA CTA CTA CTA rra rr fi 

*-rA CTA CTA CTA AGT 

AGG AGC CTG CTG CTG CTC CTG rrr 

^ii* CTG CTG CTG AGC 

30 

40 


TTC GCC CGC TTG CCC GTC GCC ACC C CC GAC CCC GGC TTC C CC TGC TTG CAC CAC 

10 CTTGCACGA CTT CCA GTA GCA ACA CCA CCAGGA 

CTC GCG CGG CTC CCG GTG GCG ACG CCG CCG GGG CCG CTC 

CTA AGA CTA 

CTA 

CTG AGG CTG 

CTG 


DU 

WSN CAR AAY YTN YTN MGN CCN GTN WSN AAY ATG YTN CAR AAR GCN MGN CAR ACN YTN GAR 

TCT CAA AAT T TA TTA CGT GCT GTT TCT AAT ATG TTA CAA AAA GOT CGT ^ ACT TTA GAA 
TCC CAG AAC TTG TTG CGC GCC GTC TCC AAC TTG CAG AAG GCC CGC CAG ACC TTG GAG 

CTT CTT CGA GCA GTA TCA CTT GCA CGA ACA CTT 


TCG CTC CTC CGG GCG GTG TCG C TC GCG CGG 

CTA AGA CTA 


ACG CTC 

AGT CTA CTA AGA AGT 


CTG CTG AGG AGC CTG AGG C TG 


70 80 
TTY TAY CCN TGV ACN WSN GAR GAR ATH GAY CAY GAR GAY ATH ACM AAR GAY AAR ACN „SN 
30 TTT TAT OCT TOT ACT TCT GAA GAA ATT GAT CAT GAA GAT ATT ACT AAA IZ AAA ACT TCT 

CCA ACA TCA A TA ATA „- 

ATA ACA ACA TCA 

CCG ACG TCG 

ACG ACG TCG 

AGT 

35 AGT 
JO AGC 

AGC 
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90 100 



Thr 

Val 

Glu 

Ala 

Cys 

Leu 

Pro 

Leu 

Glu 

Leu 

Thr 

Lys 

Asn 

Glu 

Ser 

Cys 

Leu 

Asn 

Ser 

Arg 


ACN 

GTN 

GAR 

GCN 

TGY 

YTN 

CCN 

YTN 

GAR 

YTN 

ACN 

AAR 

AAY 

GAR 

WSN 

TGY 

YTN 

AAY 

WSN 

MGN 

5 

ACT 

GTT 

GAA 

GCT 

TGT 

TTA 

CCT 

TTA 

GAA 

TTA 

ACT 

AAA 

AAT 

GAA 

TCT 

TGT 

TTA 

AAT 

TCT 

CGT 


ACC 

GTC 

GAG 

GCC 

TGC 

TTG 

CCC 

TTG 

GAG 

TTG 

ACC 

AAG 

AAC 

GAG 

TCC 

TGC 

TTG 

AAC 

TCC 

CGC 


ACA 

GTA 


GCA 


CTT 

CCA 

CTT 


CTT 

ACA 




TCA 


CTT 


TCA 

CGA 


ACG 

GTG 


GCG 


CTC 

CCG 

CTC 


CTC 

ACG 




TCG 


CTC 


TCG 

CGG 







CTA 


CTA 


CTA 





AGT 


CTA 


AGT 

AGA 

10 






CTG 


CTG 


CTG 





AGC 


CTG 


AGC 

AGG 











110 










120 


Glu 

Thr 

Ser 

Phe 

He 

Thr 

Asn 

Gly 

Ser 

Cys 

Leu 

Ala 

Ser 

Arg 

Lys 

Thr 

Ser 

Phe 

Met 

Met 


GAR 

ACN 

WSN 

TTY 

ATH 

ACN 

AAY 

GGN 

WSN 

TGY 

YTN 

GCN 

WSN 

MGN 

AAR 

ACN 

WSN 

TTY 

ATG 

ATG 

15 






















GAA 

ACT 

TCT 

TTT 

ATT 

ACT 

AAT 

GGT 

TCT 

TGT 

TTA 

GCT 

TCT 

CGT 

AAA 

ACT 

TCT 

TTT 

ATG 

ATG 


GAG 

ACC 

TCC 

TTC 

ATC 

ACC 

AAC 

GGC 

TCC 

TGC 

TTG 

GCC 

TCC 

CGC 

AAG 

ACC 

TCC 

TTC 





ACA 

TCA 


ATA 

ACA 


GGA 

TCA 


CTT 

GCA 

TCA 

CGA 


ACA 

TCA 






ACG 

TCG 



ACG 


GGG 

TCG 


CTC 

GCG 

TCG 

CGG 


ACG 

TCG 




20 



AGT 






AGT 


CTA 


AGT 

AGA 



AGT 







AGC 






AGC 


CTG 


AGC 

AGG 



AGC 














130 










140 


Ala 

Leu 

Cys 

Leu 

Ser 

Ser 

He 

Tyr 

Glu 

Asp 

Leu 

Lys 

Met 

Tyr 

Gin 

Val 

Glu 

Phe 

Lys 

Thr 

25 

GCN 

YTN 

TGY 

YTN 

WSN 

WSN 

ATH 

TAY 

GAR 

GAY 

YTN 

AAR 

ATG 

TAY 

CAR 

GTN 

GAR 

TTY 

AAR 

ACN 


GCT 

TTA 

TGT 

TTA 

TCT 

TCT 

ATT 

TAT 

GAA 

GAT 

TTA 

AAA 

ATG 

TAT 

CAA 

GTT 

GAA 

TTT 

AAA 

ACT 


GCC 

TTG 

TGC 

TTG 

TCC 

TCC 

ATC 

TAC 

GAG 

GAC 

TTG 

AAG 


TAC 

CAG 

GTC 

GAG 

TTC 

AAG 

ACC 


GCA 

CTT 


CTT 

TCA 

TCA 

ATA 




CTT 





GTA 




ACA 

30 

GCG 

CTC 


CTC 

TCG 

TCG 





CTC 





GTG 




ACG 



CTA 


CTA 

AGT 

AGT 





CTA 












CTG 


CTG 

AGC 

AGC 





CTG 











BNSDOCID <WO 994767BA2 l > 


WO 99/47678 


PCT/US99/05394 


42 


150 

160 

Arcj Gin T1 ^ pha t^... * „ , 

Leu 


H.« *.» AXa Lys Leu Leu Met Asp pro ^ ftrg ne ^ ftsp G ^ ^ 

ATG AAY GC N AAR m M ATG GflY CCN ^ „ GN CftR ftTH TTy YTN GAY CAR AAY ATG YTN 


5 


7 ^ ^ ™ ^ « ^ - ~ ATT « TTA . T CAA W ATG TTA 

^ ™ GAC CCC AAG CGC CAG ATC TTC TTG GAC CAG AAC TTG 

CTT CTT CCA CGA ATA CTT „ 

GCG CTC CTC CCG CGG CTC 

CTC CTC 


CTA CTA 


AGA cta 

10 CTA 
CTG CTG AGG 

""^ CTG 


CTG 


170 

180 


i5 « « A. CAY CAR « ATG CAR GC„ YT N AAY TTY AAY WSN GAR ACN GTN Z « ^ 

OCT GTT ATT GAT GAA TTA ATG CAA GCT TTA AAT TTT AAT Z GAA ^ GTT CCT Z Z 
GTC ~ « ™ CAG GCC TTG AAC TTC AAC TCC GAG ACC GTC CCC CAG 2 


20 


GCA GTA ATA CTT GCACTT tca 

GCG CTC 


GCG GTG CTC 
CTA 


CTA 
CTG 


ACA GTA CCA 
TCG ACG GTG CCG 

AGT 
AGC 


25 


CAY 


TCT TCT TTA GAA GAA CCT GAT TTT TAT AAA ACT AAA ATT AAA TTA TGT ATT TTA TTA CAT 

tca ::: gag gag ::; gac ttc tac ~ * cc - - «■ « ~ - « « - 

30 TCG TCG CTC CCG ^ ^ ^ ATA CTT CTT 

AGT AGT CTA ^ ^ 

CTA CTA CTfl 

AGC AGC CTG 

CTG CTG CTG 


8NSDOCID <WO _ 994767BA2 I > 


WO 99/47678 


PCT/US99/05394 


43 


210 220 
Ala Phe Arg lie Arg Ala Val Thr lie Asp Arg Val Thr Ser Tyr Leu Asn Ala Ser *** 
GCN TTY MGN ATH MGN GCN GTN ACN ATH GAY MGN GTN ACN WSN TAY YTN AAY GCN WSN TRR 

5 GCT TTT CGT ATT CGT GCT GTT ACT ATT GAT CGT GTT ACT TCT TAT TTA AAT GCT TCT TAA 
GCC TTC CGC ATC CGC GCC GTC ACC ATC GAC CGC GTC ACC TCC TAC TTG AAC GCC TCC TAG 
GCA CGA ATA CGA GCA GTA ACA ATA CGA GTA ACA TCA CTT GCA TCA TGA 

GCG CGG CGG GCG GTG ACG CGG GTG ACG TCG CTC GCG TCG 

AGA AGA AGA AGT CTA AGT 

1 0 AGG AGG AGG AGC CTG AGC 

Sequences Encoding Human IL-12 p4Q 

First line = natural sequence (SEQ ID NO - 2) 
Second line = all codons optimized (SEQ ID NO. 3) 
Third line = all codons optimized except when same 
15 nucleic acids were too close/abundant (changes between 

second and third lines bolded) (SEQ ID NO. 4) 

ATG TGT CAC CAG CAG TTG GTC ATC TCT TGG TTT TCC CTG GTT TTT CTG GCA TCT CCC CTC 
ATG TGC CAC CAG CAG CTG GTG ATC AGC TGG TTC AGC CTG GTG TTC CTG GCC AGC CCC CTG 
ATG TGC CAC CAG CAG CTG GTG ATC AGC TGG TTC TCC CTG GTG TTT CTG GCC AGC CCC CTC 

20 


25 


30 


35 


GTG 

GCC 

ATA 

TGG 

GAA 

CTG 

AAG 

AAA 

GAT 

GTT 

TAT 

GTC 

GTA 

GAA 

TTG 

GAT 

TGG 

TAT 

CCG 

GAT 

GTG 

GCC 

ATC 

TGG 

GAG 

CTG 

AAG 

AAG 

GAC 

GTG 

TAC 

GTG 

GTG 

GAG 

CTG 

GAC 

TGG 

TAC 

CCC 

GAC 

GTG 

GCC 

ATC 

TGG 

GAG 

CTG 

AAG 

AAA 

GAC 

GTG 

TAC 

GTG 

GTC 

GAG 

CTG 

GAC 

TGG 

TAC 

CCC 

GAC 

GCC 

CCT 

GGA 

GAA 

ATG 

GTG 

GTC 

CTC 

ACC 

TGT 

GAC 

ACC 

CCT 

GAA 

GAA 

GAT 

GGT 

ATC 

ACC 

TGG 

GCC 

CCC 

GGC 

GAG 

ATG 

GTG 

GTG 

CTG 

ACC 

TGC 

GAC 

ACC 

CCC 

GAG 

GAG 

GAC 

GGC 

ATC 

ACC 

TGG 

GCC 

CCC 

GGC 

GAG 

ATG 

GTG 

GTC 

CTG 

ACC 

TGC 

GAC 

ACC 

CCC 

GAG 

GAA 

GAC 

GGC 

ATC 

ACC 

TGG 

ACC 

TTG 

GAC 

CAG 

AGC 

AGT 

GAG 

GTC 

TTA 

GGC 

TCT 

GGC 

AAA 

ACC 

CTG 

ACC 

ATC 

CAA 

GTC 

AAA 

ACC 

CTG 

GAC 

CAG 

AGC 

AGC 

GAG 

GTG 

CTG 

GGC 

AGC 

GGC 

AAG 

ACC 

CTG 

ACC 

ATC 

CAG 

GTG 

AAG 

ACC 

CTG 

GAC 

CAG 

AGC 

AGT 

GAG 

GTG 

CTG 

GGC 

TCC 

GGC 

AAG 

ACC 

CTG 

ACC 

ATC 

CAG 

GTG 

AAG 

GAG 

TTT 

GGA 

GAT 

GCT 

GGC 

CAG 

TAC 

ACC 

TGT 

CAC 

AAA 

GGA 

GGC 

GAG 

GTT 

CTA 

AGC 

CAT 

TCG 

GAG 

TTC 

GGC 

GAC 

GCC 

GGC 

CAG 

TAC 

ACC 

TGC 

CAC 

AAG 

GGC 

GGC 

GAG 

GTG 

CTG 

AGC 

CAC 

AGC 

GAG 

TTC 

GGC 

GAC 

GCC 

GGC 

CAG 

TAC 

ACC 

TGC 

CAC 

AAG 

GGA 

GGC 

GAG 

GTG 

CTG 

AGC 

CAC 

TCC 


BNSOOCID: <WO 9947678A2 I > 


WO 99/47678 


PCT/US99/0S394 


44 


CTC CTG CTG CTT CAC AAA AAG GAA GAT GGA ATT TGG TCC ACT GAT ATT TTA AAG GAC CAG 
CTG CTG CTG CTG CAC AAG AAG GAG GAC GGC ATC TGG AGC ACC GAC ATC CTG AAG GAC CAG 
CTC CTG CTG CTC CAC AAA AAG GAG GAC GGC ATC TGG AGC ACC GAC ATC CTG AAG GAC CAG 

AAA GAA CCC AAA AAT AAG ACC TTT CTA AGA TGC GAG GCC AAG AAT TAT TCT GGA CGT TTC 
AAG GAG CCC AAG AAC AAG ACC TTC CTG CGC TGC GAG GCC AAG AAC TAC AGC GGC CGC TTC 
AAG GAG CCC AAG AAC AAG ACC TTC CTG CGC TGC GAG GCC AAG AAC TAC AGC GGC CGC TTC 

ACC TGC TGG TGG CTG ACG ACA ATC AGT ACT GAT TTG ACA TTC ACT GTC AAA AGC AGC AGA 
ACC TGC TGG TGG CTG ACC ACC ATC AGC ACC GAC CTG ACC TTC AGC GTG AAG AGC AGC AGG 
ACC TGC TGG TGG CTG ACC ACG ATC AGC ACC GAC CTG ACC TTC AGT GTG AAG AGC AGC AGG 

GGC TCT TCT GAC CCC CAA GGG GTG ACG TGC GGA GCT GCT ACA CTC TCT GCA GAG AGA GTC 
15 GGC AGC AGC GAC CCC CAG GGC GTG ACC TGC GGC CCC GCC ACC CTG AGC GCC GAG CGC GTG 
GGC TCC AGC GAC CCC CAG GGC GTG ACC TGC GGC GCT GCC ACC CTG AGC GCC GAG CGC GTG 


10 


20 


25 


30 


AGA GGG GAC AAC AAG GAG TAT GAG TAC TCA GTG GAG TGC CAG GAG GAC AGT GCC TGC CCA 
CGC GGC GAC AAC AAG GAG TAC GAG TAC AGC GTG GAG TGC CAG GAG GAC AGC GCC TGC CCC 
CGC GGC GAC AAC AAG GAG TAC GAG TAC AGC GTG GAG TGC CAG GAA GAC TCC GCC TGC CCC 

GCT GCT GAG GAG AGT CTG CCC ATT GAG GTC ATG GTG GAT GCC GTT CAC AAG CTC AAG TAT 
GCC GCC GAG GAG AGC CTG CCC ATC GAG GTG ATG GTG GAC GCC GTC CAC AAG CTG AAG TAC 
GCC GCT GAG GAG AGC CTG CCC ATC GAG GTG ATG GTG GAC GCC GTT CAC AAG CTG AAG TAC 

GAA AAC TAC ACC AGC AGC TTC TTC ATC AGG GAC ATC ATC AAA CCT GAC CCA CCC AAG AAC 
GAG AAC TAC ACC AGC AGC TTC TTC ATC CGC GAC ATC ATC AAG CCC GAC CCC CCC AAG AAC 
GAG AAC TAC ACC AGC AGC TTC TTC ATC CGC GAC ATC ATC AAG CCT GAC CCA CCC AAG AAC 


TTG CAG CTG AAG CCA TTA AAG AAT TCT CGG CAG GTG GAG GTC AGC TGG GAG TAC CCT GAC 
CTG CAG CTG AAG CCC CTG AAG AAC AGC CGC CAG GTG GAG GTG AGC TGG GAG TAC CCC GAC 
CTC CAG CTG AAG CCC CTC AAG AAC TCC CGC CAG GTG GAG GTG AGC TGG GAG TAC CCC GAC 
ACC TGG AGT ACT CCA CAT TCC TAC TTC TCC CTG ACA TTC TGC GTT CAG GTC CAG GGC AAG 
ACC TGG AGC ACC CCC CAC AGC TAC TTC AGC CTG ACC TTC TGC GTG CAG GTG CAG GGC AAG 
35 ACC TGG AGC ACG CCC CAC TCC TAC TTC TCC CTG ACC TTC TGC GTG CAG GTC CAG GGC AAG 


BNSOOCID: <WO 994767BA2 I > 


WO 99/47678 


PCT/US99/0S394 


45 


AGC 

AAG 

AG A 

GAA 

AAG 

AAA 

GAT 

AGA 

GTC 

TTC 

ACG 

GAC 

AAG 

ACC 

TCA 

GCC 

ACG 

GTC 

ATC 

TGC 

AGC 

AAG 

CGC 

GAG 

AAG 

AAG 

GAC 

CGC 

GTG 

TTC 

ACC 

GAC 

AAG 

ACC 

AGC 

GCC 

ACC 

GTG 

ATC 

TGC 

AGC 

AAG 

CGC 

GAG 

AAG 

AAA 

GAC 

CGG 

GTG 

TTC 

ACC 

GAC 

AAG 

ACC 

AGC 

GCC 

ACC 

GTC 

ATC 

TGC 

CGC 

AAA 

AAT 

GCC 

AGC 

ATT 

AGC 

GTG 

CGG 

GCC 

CAG 

GAC 

CGC 

TAC 

TAT 

AGC 

TCA 

TCT 

TGG AGC 

CGC 

AAG 

AAC 

GCC 

AGC 

ATC 

AGC 

GTG 

CGC 

GCC 

CAG 

GAC 

CGC 

TAC 

TAC 

AGC 

AGC 

AGC 

TGG 

AGC 

CGC 

AAG 

AAC 

GCC 

AGC 

ATC 

AGC 

GTG 

CGC 

GCC 

CAG 

GAC 

CGC 

TAC 

TAT 

AGC 

TCC 

TCT 

TGG 

AGC 

GAA 

TGG 

GCA 

TCT 

GTG 

CCC 

TGC 

AGT 

TAG 












GAG 

TGG 

GCC 

AGC 

GTG 

CCC 

TGC 

AGC 

TAG 












GAG 

TGG 

GCC 

AGC 

GTG 

CCC 

TGC 

TCC 

TAG 













Additional Optimized Sequences Coding For IL-12 p4 0 Subunit 
(Second Line SEQ ID NO-.25) 












10 










20 

15 

Met 

Cys 

His 

Gin 

Gin 

Leu 

Val 

He 

Ser 

Trp 

Phe 

Ser 

Leu 

Val 

Phe 

Leu 

Ala 

Ser 

Pro 

Leu 


ATG 

TGY 

CAY 

CAR 

CAR 

YTN 

GTN 

ATH 

WSN 

TGG 

TTY 

WSN 

YTN 

GTN 

TTY 

YTN 

GCN 

WSN 

CCN 

YTN 


ATG" TGT 

CAT 

CAA 

CAA 

TTA 

GTT 

ATT 

TCT 

TGG 

TTT 

TCT 

TTA 

GTT 

TTT 

TTA 

GCT 

TCT 

CCT 

TTA 



TGC 

CAC 

CAG 

CAG 

TTG 

GTC 

ATC 

TCC 


TTC 

TCC 

TTG 

GTC 

TTC 

TTG 

GCC 

TCC 

CCC 

TTG 

20 






CTT 

GTA 

ATA 

TCA 



TCA 

CTT 

GTA 


CTT 

GCA 

TCA 

CCA 

CTT 







CTC 

GTG 


TCG 



TCG 

CTC 

GTG 


CTC 

GCG 

TCG 

CCG 

CTC 







CTA 



AGT 



AGT 

CTA 



CTA 


AGT 


CTA 







CTG 



AGC 



AGC 

CTG 



CTG 


AGC 


CTG 

25 










30 










40 


Val 

Ala 

He 

Trp 

Glu 

Leu 

Lys 

Lys 

Asp 

Val 

Tyr 

val 

Val 

Glu 

Leu 

Asp 

Trp 

Tyr 

Pro Asp 


GTN 

GCN 

ATH 

TGG 

GAR 

YTN 

AAR 

AAR 

GAY 

GTN 

TAY 

GTN 

GTN 

GAR 

YTN 

GAY 

TGG 

TAY 

CCN 

GAY 


GTT 

GCT 

ATT 

TGG 

GAA 

TTA 

AAA 

AAA 

GAT 

GTT 

TAT 

GTT 

GTT 

GAA 

TTA 

GAT 

TGG 

TAT 

CCT 

GAT 

30 

GTC 

GCC 

ATC 


GAG 

TTG 

AAG 

AAG 

GAC 

GTC 

TAC 

GTC 

GTC 

GAG 

TTG 

GAC 


TAC 

CCC 

GAC 


GTA 

GCA 

ATA 



CTT 




GTA 


GTA 

GTA 


CTT 




CCA 



GTG 

GCG 




CTC 




GTG 


GTG 

GTG 


CTC 




CCG 



CTA CTA 
CTG CTG 

35 


BNSOOGID <WO 9947678A2 I > 


WO 99/47678 


PCT/US99/05394 


46 


50 

60 


10 


« « ggn „ ^ GTN „ „ _ TGy w acn cc(j GflR _ J J m - - -p 

GCT CCT GGT GAA ATG GTT GTT TTA ACT TPT ran- Rm 

ACT TGT ACT CCT GAA GAA GAT GGT ATT ACT TOT 

2 2 2 GAG GTC GTC TTG ACC TCC ~ Acc ccc « ~ « « - « 

CCG GTG GTG CTC ACG ACG CCC 

CCG GGG ACG 

CTA 


w «. ytn „ Y car WSN „ „ ^ ggn wsn _ _ _ _ J «■ - £ 

ACT TTA GAT CAA TCT TCT CAA GTT TTA GGT TCT GGT AAA ACT TTA ACT ATT ^ ^ ^ 

:: :;; gac cac :: ;: - z - - « - ~ « « « « C a G g 5 ;; r 

ACG CTC GTA CTT GGA TCA GGA ACA CTT ACA ATA GTA 

TCG TCG GTG CTC GGG TCG GGG ACG CTC ACG 

CTA AGT AGT CTA AGT CTA 


CTG AGC AGC CTG AGC 


CTG 


100 


«» Phe Gly Aa p „. Gly Gln Tyr Thr cys His ^ 
25 „ „ „ „ GCN GGN CAR » Y ACN TGY CA, AAR GGN GGN GAR GTN YTN WSN ^ ^ 

GAA TTT GGT GAT GCT GGT CAA TAT ACT TGT CAT AAA GGT GGT C*a Zl 

/4fta GGT GAA GTT TTA TCT CAT tct 

« « GGC GAC GCC GCC CAG TAC ACC TCC CAC AAG GGC GGC GAG GTC TTG TCC Z TCC 

GGA GCA GGA ACA 

30 _ GTA CTT TCA TCA 

JU GGG GCG GGG A r r 

G GGG GGG GTG CTC TCG TCG 

CTA AGT AGT 

CTG AGC AGC 


BNSOOC1D <WO_994767BA2 I > 


WO 99/47678 


PCT/US99/05394 


47 

110 120 



Leu 

Leu 

Leu 

Leu 

His 

Lys 

Lys 

Glu 

Asp Gly 

lie 

Trp 

Ser 

Thr 

Asp 

He 

Leu 

Lys 

Asp 

Gin 


YTN 

YTN 

YTN 

YTN 

CAY 

AAR 

AAR 

GAR 

GAY 

GGN 

ATH 

TGG 

WSN 

ACN 

GAY 

ATH 

YTN 

AAR 

GAY 

CAR 

5 

TTA 

TTA 

TTA 

TTA 

CAT 

AAA 

AAA 

GAA 

GAT 

GGT 

ATT 

TGG 

TCT 

ACT 

GAT 

ATT 

TTA 

AAA 

GAT 

CAA 


TTG 

TTG 

TTG 

TTG 

CAC 

AAG 

AAG 

GAG 

GAC 

GGC 

ATC 


TCC 

ACC 

GAC 

ATC 

TTG 

AAG 

GAC 

CAG 


CTT 

CTT 

CTT 

CTT 






GGA 

ATA 


TCA 

ACA 


ATA 

CTT 





CTC 

CTC 

CTC 

CTC 






GGG 



TCG 

ACG 



CTC 





CTA 

CTA 

CTA 

CTA 









AGT 




CTA 




10 

CTG 

CTG 

CTG 

CTG 









AGC 




CTG 














130 










140 


Lys 

Glu 

Pro 

Lys 

Asn 

Lys 

Thr 

Phe 

Leu 

Arg 

Cys 

Glu 

Ala 

Lys 

Asn 

Tyr 

Ser 

Gly 

Arg 

Phe 








ACN 

TTY 

YTN 

MGN 

TGY 

GAR 

GCN 

AAR 

AAY 

TAY 

WSN 

GGN 

MGN 

TTY 

15 






















AAA 

GAA 

CCT 

AAA 

AAT 

AAA 

ACT 

TTT 

TTA 

CGT 

TGT 

GAA 

GCT 

AAA 

AAT 

TAT 

TCT 

GGT 

CGT 

TTT 


AAG 

GAG 

CCC 

AAG 

AAC 

AAG 

ACC 

TTC 

TTG 

CGC 

TGC 

GAG 

GCC 

AAG 

AAC 

TAC 

TCC 

GGC 

CGC 

TTC 




CCA 




ACA 


CTT 

CGA 



GCA 




TCA 

GGA 

CGA 





CCG 




ACG 


CTC 

CGG 



GCG 




TCG 

GGG 

CGG 


20 









CTA 

AGA 







AGT 


AGA 











CTG 

AGG 







AGC 


AGG 












150 










160 


Thr 

Cys 

Trp 

Trp 

Leu 

Thr 

Thr 

lie 

Ser 

Thr 

Asp 

Leu 

Thr 

Phe 

Ser 

Val 

Lys 

Ser 

Ser 

Arg 

25 

ACN 

TGY 

TGG 

TGG 

YTN 

ACN 

ACN 

ATH 

WSN 

ACN 

GAY 

YTN 

ACN 

TTY 

WSN 

GTN 

AAR 

WSN 

WSN 

MGN 


ACT 

TGT 

TGG 

TGG 

TTA 

ACT 

ACT 

ATT 

TCT 

ACT 

GAT 

TTA 

ACT 

TTT 

TCT 

GTT 

AAA 

TCT 

TCT 

CGT 


ACC 

TGC 



TTG 

ACC 

ACC 

ATC 

TCC 

ACC 

GAC 

TTG 

ACC 

TTC 

TCC 

GTC 

AAG 

TCC 

TCC 

CGC 


ACA 




CTT 

ACA 

ACA 

ATA 

TCA 

ACA 


CTT 

ACA 


TCA 

GTA 


TCA 

TCA 

CGA 

30 

ACG 




CTC 

ACG 

ACG 


TCG 

ACG 


CTC 

ACG 


TCG 

GTG 


TCG 

TCG 

CGG 






CTA 




AGT 



CTA 



AGT 



AGT 

AGT 

AGA 






CTG 




AGC 



CTG 



AGC 



AGC 

AGC 

AGG 


BNSDOCIO <WO S94767BA2 I > 


WO 99/47678 


PCT/US99/0S394 


48 


CGT GTT 


200 


170 

Gly Ser ser A3 P Pro G1 „ Gly Val Thr Cys G "0 

- « - « „ « „ „ „ r „ ™ :;; r 

5 OCT TCT TCT G ftT CCT CAA GGT CTT ACT TGT GGT Z Z Z Z TCT GCT GAA 

« z z ** ::: cag ggc ctc acc tgc ggc gcc « * cc « ~ « - « CTC 

GGA TCA TCA GGA GTA ACA GGA GCA GCA ACA CTT TCA GCA CGA GTA 

GGG TCG TCG COG GGG GTG ACG GGG GCG GCG ACG CTC TCG GCG CGG GTG 

AGT AGT GTG 

1 fl ^A AGT AGA 

10 AGC AGC GA 

CTG AGC AGG 

190 

Arg Gly Asp Asn Lys Glu Tyr 

is - - - - - „ zzz 2 :: z °z z; 

CGT GGT GAT AAT AAA GAA TAT GAA TAT TCT GTT GAA TGT CAA r*n n» 

^„ _ CAA GAA GAT TCT GCT TGT CCT 

« «c GAC AAC AAG GAG T AC GAG TAC TC C G TC GAG TGC CAG GAG GAC T CC GCC TGC Z 

CGG GGG TCA GTA ^ 

20 ftG A TCG GTG TCG GCG CCG 

AGG 

AGC ' AGC 

210 

25 2 2 Glu Glu ser Leu Pro lls Glu vai Met vai *- «• - - - ™ 

«. « GAK GAK „S„ ™ CCN ATH GAK GTN ATG GTN GAY GCN GTN CAY AAR YTN AAR Jy 

OCT GCT GAA GAA TCT TTA CCT ATT GAA GTT ATG GTT GAT GCT Z Z AAA Z Z Z 
GCC GAG GAG TCC TTG CCC ATC GAG GTC GTC r»r r,, 

^ G GTC CAC AAG TTG AAG TAC 

GCA GCA TCA CTT CCA ATA GTA GTA GCA GTA 

30 GCG GCG TCG CTr rrr 

TCG CTC CCG GTG GTG GCG GTG C TC 

AGT CTA 

CTA 

AGC CTG 

CTG 


BNSDOCID cWO. 994767BA2 I > 


WO 99/47678 


PCT/US99/0S394 


49 


230 240 
Glu Asn Tyr Thr Ser Ser Phe Phe He Arg Asp He He Lys Pro Asp Pro Pro Lys Asn 
- GAR AAY TAY ACN WSN WSN TTY TTY ATH MGN GAY ATH ATH AAR CCN GAY CCN CCN AAR AAY 

5 GAA AAT TAT ACT TCT TCT TTT TTT ATT CGT GAT ATT ATT AAA CCT GAT CCT CCT AAA AAT 

GAG AAC TAC ACC TCC TCC TTC TTC ATC CGC GAC ATC ATC AAG CCC GAC CCC CCC AAG AAC 

ACA TCA TCA ATA CGA ATA ATA CCA CCA CCA 

ACG TCG TCG CGG CCG CCG CCG 

AGT AGT AGA 
10 AGC AGC AGG 

250 260 
Leu Gin Leu Lys Pro Leu Lys Asn Ser Arg Gin Val Glu Val Ser Trp Glu Tyr Pro Asp 
YTN CAR YTN AAR CCN YTN AAR AAY WSN MGN CAR GTN GAR GTN WSN TGG GAR TAY CCN GAY 


15 


TTA CAA TTA AAA CCT TTA AAA AAT TCT CGT CAA GTT GAA GTT TCT TGG GAA TAT CCT GAT 
TTG CAG TTG AAG CCC TTG AAG AAC TCC CGC CAG GTC GAG GTC TCC GAG TAC CCC GAC 

CTT CTT CCA CTT TCA CGA GTA GTA TCA CCA 

CTC CTC CCG CTC TCG CGG GTG GTG TCG CCG 

20 CTA CTA CTA AGT AGA AGT 

CTG CTG CTG AGC AGG AGC 

270 280 
Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr Phe Cys Val Gin Val Gin Gly Lys 
25 ACN TGG WSN ACN CCN CAY WSN TAY TTY WSN YTN ACN TTY TGY GTN CAR GTN CAR GGN AAR 

ACT TGG TCT ACT CCT CAT TCT TAT TTT TCT TTA ACT TTT TGT GTT CAA GTT CAA GGT AAA 

ACC TCC ACC CCC CAC TCC TAC TTC TCC TTG ACC TTC TGC GTC CAG GTC CAG GGC AAG 

ACA TCA ACA CCA TCA TCA CTT ACA GTA GTA GGA 

30 ACG TCG ACG CCG TCG TCG CTC ACG GTG GTG GGG 

AGT AGT AGT CTA 

AGC AGC AGC CTG 
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290 

~ = :: r r :: r z z z r r - - - » - ::: 

_•- ACN W *** »SN GCN ACN GTN ATH TGY 

5 TCT AAA CGT GAA AAA AAA GAT CGT GTT TTT ACT GAT a» IZ 

™ AAG CCC GAG AAG AAG GAC CGC GTC TTC 2 Z 2 Z ^ " ^ "* 
TCA CGA ACC TCC GCC ACC GTC ATC TGC 

TCG CGG CGAGTA ACA TCA GCA ACA GTA ATA 

AGT ACA CGGGTG ACG TCG GCG ACG GTG 

10 AGC - 

AGC 

310 

Arg Lys Asn Ala Ser lie Ser Val Arg Ala Gin a. * 320 

y Ala Gin Asp Arg Tyr Tvr q^r- c^*- o 

ia --.»-. „ „, ^ try :;; :;; :; :;; 


CGC AAG AAC GCC TCC ATC TCC GTC CGC 
CGA gca TCA ATA TCA GTA CGA GCA 

CGG GCG TCG 


TCG GTG CGG GCG 
20 AGA ftGT AGA 

AGG AGC AGC AGG 


GAT 

CGT 

TAT 

TAT 

TCT 

TCT 

TCT 

TGG TCT 

GAC 

CGC 

TAC 

TAC 

TCC 

TCC 

TCC 

TCC 


CGA 



TCA 

TCA 

TCA 

TCA 


CGG 



TCG 

TCG 

TCG 

TCG 


AGA 



AGT 

AGT 

AGT 

AGT 


AGG 



AGC 

AGC 

AGC 

AGC 


25 


30 


Glu Trp Ala Ser Val Pro c ys Ser *** 
GAR TGG GCN WSN GTN CCN TGY WSN TRR 

GAA TGG GCT TCT GTT CCT TGT TCT TAA 
GAG GCC TCC GTC CCC TGC TCC TAG 

GCA TCA GTA CCA TCA TGA 

GCG TCG GTG CCG TCG 


AGT 
AGC 


AGT 
AGC 
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Wild Type Sequence Coding For Interferon Alpha 

9 18 27 36 45 54 

5' ATG GCC TTG ACC TTT GCT TTA CTG GTG GCC CTC CTG GTG CTC AGC TGC AAG TCA 

5 MALTFALLVALLVLSCKS 

63 72 81 90 99 108 

AGC TGC TCT GTG GGC TGT GAT CTG CCT CAA ACC CAC AGC CTG GGT AGC AGG AGG 

10 SCSVGCDLPQTHSLGSRR 

117 126 135 144 153 162 

ACC TTG ATG CTC CTG GCA CAG ATG AGG AGA ATC TCT CTT TTC TCC TGC TTG AAG 

15 T LM L LAQMRR I S L F S ,C L.K 

171 180 189 198 207 216 

GAC AGA CAT GAC TTT GGA TTT CCC CAG GAG GAG TTT GGC AAC CAG TTC CAA AAG 

20 DRHD FGFPQEE FGNQFQK 

225 234 243 252 261 270 

GCT GAA ACC ATC CCT GTC CTC CAT GAG ATG ATC CAG CAG ATC TTC AAT CTC TTC 

25 AETI PVLHEMIQQI FNLF 

279 288 297 306 315 324 

AGC ACA AAG GAC TCA TCT GCT GCT TGG GAT GAG ACC CTC CTA GAC AAA TTC TAC 

30 STKDSSAAWDETLLDKFY 

333 342 351 360 369 378 

ACT GAA CTC TAC CAG CAG CTG AAT GAC CTG GAA GCC TGT GTG ATA CAG GGG GTG 

35 TELYQQLNDLEACVIQGV 
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396 


GGG CTG ACA GAG ACT CCC CTC ATG AAG GAG GAC TCC ATT CTG GCT GTG AGG AAA 


GVTET PLMKED 


S I L A V 


TAC TTC CAA AGA ATC ACT CTC TAT CTO AAA GAG AAG AAA TAC AGC CCT TGT CCC 


10 


Y F ° R 1 T f L K E K K 


Y S P C A 


15 


TOO GAG GTT GTC AGA GCA GAA ATC ATG AGA TCT TTT TCT TTG TCA ACA AAC TTG 

SFSLSTNL 


W EV VRAEIMR 


549 558 567 

CAA GAA AGT TTA AGA AGT AAG GAA TGA 3 ■ 

QESLRSKE* 

ATG GCC CTG ACC TTC CCC CTG CTG GTG GCC CTG CTG GTG CTG AGC TGC AAG AGC AGC TGC 
25 TCC GTG GGG TGC GAC CTG CCC CAG ACC CAC AGC CTG GGG AGC CGG CGG ACC CTG ATG CTG 

^ TTC CCC CAG GAG GAG TTC GGG AAC CAG TTC CAG AAG GCC GAG ACC ATC CCC GTG CTG CAC 
35 TGC GTG ATC CAG GGG GTG GGG GTG ACC GAG ACC CCC CTG ATG AAG GAG GAC AGC ATC CTG 
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GCC GTG CGG AAG TAC TTC CAG CGG ATC ACC CTG TAC CTG AAG GAG AAG AAG TAC TCC CCC 
TGC GCC TGG GAG GTG GTG CGG GCC GAG ATC ATG CGG AGC TTC AGC CTG AGC ACC AAC CTG 

5 

CAG GAG AGC CTG CGG AGC AAG GAG TGA 

Additional /Semi Optimized Sequence Coding For Interferon 
Alpha (Second Line - SEQ ID NO: 12) 

10 10 20 


20 


MET 

ALA 

LEU 

THR 

PHE 

ALA 

LEU 

LEU 

VAL 

ALA 

LEU 

LEU 

VAL 

LEU 

SER 

CYS 

LYS 

SER 

SER 

CYS 

ATG 

GCN 

YTN 

ACN 

TTY 

GCN 

YTN 

YTN 

GTN 

GCN 

YTN 

YTN 

GTN 

YTN 

WSN 

TGY 

AAR 

WSN 

WSN 

TGY 

ATG 

GCT 

TTA 

ACT 

TTT 

GCT 

TTA 

TTA 

GTT 

GCT 

TTA 

TTA 

GTT 

TTA 

TCT 

TGT 

AAA 

TCT 

TCT 

TGT 


GCC 

TTG 

ACC 

TTC 

GCC 

TTG 

TTG 

GTC 

GCC 

TTG 

TTG 

GTC 

TTG 

TCC 

TGC 

AAG 

TCC 

TCC 

TGC 


GCA 

CTT 

ACA 


GCA 

CTT 

CTT 

GTA 

GCA 

CTT 

CTT 

GTA 

CTT 

TCA 



TCA 

TCA 



GCG 

CTC 

ACG 


GCG 

CTC 

CTC 

GTG 

GCG 

CTC 

CTC 

GTG 

CTC 

TCG 



TCG 

TCG 




CTA 




CTA 

CTA 



CTA 

CTA 


CTA 

AGT 



AGT 

AGT 




CTG 




CTG 

CTG 



CTG 

CTG 


CTG 

AGC 



AGC 

AGC 











30 










40 

SER 

VAL 

GLY 

CYS 

ASP 

LEU 

PRO 

GLN 

THR 

HIS 

SER 

LEU 

GLY 

SER 

ARG 

ARG 

THR 

LEU 

MET 

LEU 

WSN 

GTN 

GGN 

TGY 

GAY 

YTN 

CCN 

CAR 

ACN 

CAY 

WSN 

YTN 

GGN 

WSN 

MGN 

MGN 

ACN 

YTN 

ATG 

YTN 

TCT 

GTT 

GGT 

TGT 

GAT 

TTA 

CCT 

CAA 

ACT 

CAT 

TCT 

TTA 

GGT 

TCT 

CGT 

CGT 

ACT 

TTA 

ATG 

TTA 

TCC 

GTC 

GGC 

TGC 

GAC 

TTG 

CCC 

CAG 

ACC 

CAC 

TCC 

TTG 

GGC 

TCC 

CGC 

CGC 

ACC 

TTG 


TTG 

TCA 

GTA 

GGA 



CTT 

CCA 


ACA 


TCA 

CTT 

GGA 

TCA 

CGA 

CGA 

ACA 

CTT 


CTT 

TCG 

GTG 

GGG 



CTC 

CCG 


ACG 


TCG 

CTC 

GGG 

TCG 

CGG 

CGG 

ACG 

CTC 


CTC 

AGT 





CTA 





AGT 

CTA 


AGT 

AGA 

AGA 


CTA 


CTA 


30 AGC CTG AGC CTG AGC AGG AGG CTG CTG 
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10 


50 

60 

YTH GCN CAR ATG MGN MGN ATH WSN YTN TTY WSN TGY YTN AAR GAY MGN CAY GAY TTY GGN 

TTA GCT CAA ATG CGT CGT ATT TCT TTA TTT TCT TGT TTA AAA GAT CGT CAT GAT TTT GGT 
TTG GCC CAG CGC CGC ATC TCC TTG TTC TCC TGC TTG AAG GAC CGC CAC GAC TTC GGC 

CTT GCA CGA CGA ATA TCA CTT TCA CTT rrA 

CGA GGA 
CTC GCG CGG CGG TCC CTC TCG CTC err 

CGG GGG 
AGA AGA AGT CTA AGT CTA AGA 

CTG AGG AGG AGC CTG AGC CTG AGG 


15 


20 


30 


TTY «. CAR GAR GAR TTY GGN AAY CAR TTY CAR AAR GCN GAR ACN ATH CCN GTN YTN CAY 
TTT CCT CAA GAA GAA TTT GGT AAT CAA TTT CAA AAA GCT GAA ACT ATT CCT GTT TTA CAT 


CCA 


GGA 


GCA ACA ATA CCA GTA CTT 

GGG GCG ACG CCG GTG CTC 

CTA 
CTG 


1U0 

GU, MET ILE GLN GLN ILE PHE ASN LEU PHE SER THR L VS ASP SER SER flLA ALft TRP flsp 
GAR ATG ATH CAR CAR ATH TTY AAY YTN TTY WSN ACN AAR GAY WSN WSN GCN GCN TGG GAY 


GAA ATG ATT CAA CAA ATT TTT AAT TTA TTT TCT ACT AAA GAT TCT TCT GCT GCT TGG GAT 
ATC CAG CAG ATC TTC AAC TTG TTC TCC ACC AAG GAC TCC TCC GCC GCC GAC 


ATA ATA 

™ ACA TCA TCA GCA GCA 

CTC TCG ACG TC G TCG GCG GCG 


CTA AGT 
CTG AGC 


AGT AGT 
AGC AGC 
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110 120 


GLU 

THR 

LEU 

LEU 

ASP 

LYS 

PHE 

TYR 

THR 

GLU 

LEU 

TYR 

GLN 

GLN 

LEU 

ASN 

ASP 

LEU 

GLU 

ALA 

GAR 

ACN 

YTN 

YTN 

GAY 

AAR 

TTY 

TAY 

ACN 

GAR 

YTN 

TAY 

CAR 

CAR 

YTN 

AAY 

GAY 

YTN 

GAR 

GCN 

GAA 

ACT 

TTA 

TTA 

GAT 

AAA 

TTT 

TAT 

ACT 

GAA 

TTA 

TAT 

CAA 

CAA 

TTA 

AAT 

GAT 

TTA 

GAA 

GCT 

GAG 

ACC 

TTG 

TTG 

GAC 

AAG 

TTC 

TAC 

ACC 

GAG 

TTG 

TAC 

CAG 

CAG 

TTG 

AAC 

GAC 

TTG 

GAG 

GCC 


ACA 

CTT 

CTT 





ACA 


CTT 




CTT 



CTT 


GCA 


ACG 

CTC 

CTC 





ACG 


CTC 




CTC 



CTC 


GCG 



CTA 

CTA 







CTA 




CTA 



CTA 





CTG 

CTG 







CTG 




CTG 



CTG 




130 

CYS VAL ILE GLN GLY VAL GLY VAL THR GLU THR PRO LEU MET LYS GLU ASP SER ILE 
TGY GTN ATH CAR GGN GTN GGN GTN ACN GAR ACN CCN YTN ATG AAR GAR GAY WSN ATH 


15 






















TGT 

GTT 

ATT 

CAA 

GGT 

GTT 

GGT 

GTT 

ACT 

GAA 

ACT 

CCT 

TTA 

ATG 

AAA 

GAA 

GAT 

TCT 

ATT 

TTA 


TGC 

GTC 

ATC 

CAG 

GGC 

GTC 

GGC 

GTC 

ACC 

GAG 

ACC 

CCC 

TTG 


AAG 

GAG 

GAC 

TCC 

ATC 

TTG 



GTA 

ATA 


GGA 

GTA 

GGA 

GTA 

ACA 


ACA 

CCA 

CTT 





TCA 

ATA 

CTT 



GTG 



GGG 

GTG 

GGG 

GTG 

ACG 


ACG 

CCG 

CTC 





TCG 


CTC 

20 













CTA 





AGT 


CTA 














CTG 





AGC 


CTG 











150 










160 


AXA 

VAL 

ARG 

LYS 

TYR 

PHE 

GLN 

ARG 

ILE 

THR 

LEU 

TYR 

LEU 

LYS 

GLU 

LYS 

LYS 

TYR 

SER 

PRO 

25 

GCN 

GTN 

MGN 

AAR 

TAY 

TTY 

CAR 

MGN 

ATH 

ACN 

YTN 

TAY 

YTN 

AAR 

GAR 

AAR 

AAR 

TAY 

WSN 

CCN 


GCT 

GTT 

CGT 

AAA 

TAT 

TTT 

CAA 

CGT 

ATT 

ACT 

TTA 

TAT 

TTA 

AAA 

GAA 

AAA 

AAA 

TAT 

TCT 

CCT 


GCC 

GTC 

CGC 

AAG 

TAC 

TTC 

CAG 

CGC 

ATC 

ACC 

TTG 

TAC 

TTG 

AAG 

GAG 

AAG 

AAG 

TAC 

TCC 

CCC 


GCA 

GTA 

CGA 





CGA 

ATA 

ACA 

CTT 


CTT 






TCA 

CCA 

30 

GCG 

GTG 

CGG 





CGG 


ACG 

CTC 


CTC 






TCG 

CCG 




AGA 





AGA 



CTA 


CTA 






AGT 





AGG 





AGG 



CTG 


CTG 






AGC 
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10 


180 

CYS ALA TRP GLU VAL VAL AUG ALA GLU ILE MET ARC SER PHE SER LEU SER THR ASN LEU 
TGY GCN TGG GAR GTN GTN MGN GCN GAR ATH ATG MGN WSN TTY WSN YTN WSN ACN AAY YTN 

TGT GCT TGG GAA GTT GTT CGT GCT GAA ATT ATG CGT TCT TTT TCT TTA TCT ACT AAT TTA 
TGC GCC GAG GTC GTC CGC GCC GAG ATC CGC TCC TTC TCC TTG TCC ACC AAC TTG 

GCA GTA GTA CGA GCA ATA CGA TCA TCA CTT TCA ACA CTT 

GCG GTG GTG CGG GCG CGG TCG TCG CTC TCG ACG CTC 

AGA AGA AGT AGT CTA AGT CTA 

AGG AGG AGC AGC CTG AGC CTG 


15 


20 


GLN GLU SER LEU ARG SER LYS GLU *** 
CAR GAR WSN YTN MGN WSN AAR GAR TRR 

CAA GAA TCT TTA CGT TCT AAA GAA TAA 
GAG GAG TCC TTG CGC TCC AAG GAG TAG 

TCA CTT CGA TCA TGA 

TCG CTC CGG TCG 

AGT CTA AGA AGT 

AGC CTG AGG AGC 


Delivery and expression of nucleic acids in many 
formulations is limited due to degradation of the nucleic 

25 acids by components of organisms, such as nucleases. Thus, 
protection of the nucleic acids when delivered in vivo can 
greatly enhance the resulting expression, thereby enhancing 
a desired pharmacological or therapeutic effect. It was 
found that certain types of compounds which interact with a 

30 nucleic acid (e.g., DNA) in solution but do not condense the 
nucleic acid provide in vivo protection to the nucleic acid, 
and correspondingly enhance the expression of an encoded 
gene product. 

We have described the use of delivery systems designed 
35 to interact with plasmids and protect plasmids from rapid 
extracellular nuclease degradation [Mumper, R.J., e t al., 
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1996, Pharm. Res. 13:701-709; Mumper, R.J., et al., 1997. 
Submitted to Gene Therapy] . A characteristic of the PINC 
systems is that they are non-condensing systems that allow 
the plasmid to maintain flexibility and diffuse freely 
5 throughout the muscle while being protected from nuclease 
degradation. While the PINC systems are primarily discussed 
below, it will be understood that cationic lipid based 
systems and systems utilizing both PINCS and cationic lipids 
are also within the scope of the present invention. 
10 A common structural component of the PINC systems is 

that they are amphiphilic molecules, having both a 
hydrophilic and a hydrophobic portion. * The hydrophilic 
portion of the PINC is meant to interact with plasmids by 
hydrogen bonding (via hydrogen bond acceptor or donor 
15 groups), Van der Waals interactions, or/and by ionic 
interactions. For example, PVP and N-methyl-2-pyrrolidone 
<NM2P) are hydrogen bond acceptors while PVA and PG are 
hydrogen bond donors . 

All four molecules have been reported to form complexes 
20 with various (poly) anionic molecules [Buhler V., BASF 
Aktiengescellschaft Feinchemie, Ludwigshaf en, pp 39-42; ' 
Galaev Y, et al., J. Chrom. A. 684:45-54 (1994); Tarantino 
R, et al. J. Pharm. Sci. 83:1213-1216 (1994); Zia, H. , et 
al. , Pharm. Res. 8:502-504 (1991);]. The hydrophobic 

25 portion of the PINC systems is designed to result in a 
coating on the plasmid rendering its surface more 
hydrophobic. Kabanov et al. have described previously the 
use of cationic polyvinyl derivatives for plasmid 
condensation designed to increase plasmid hydrophobicity, 
30 protect plasmid from nuclease degradation, and increase its 
affinity for biological membranes [Kabanov, A.V., and 
Kabanov, V.A. , 1995, Bioconj . Chem, 6:7-20; Kabanov, A.V., 
et al., 1991, Biopolymers 31:14 37-1443; Yaroslavov, A. A. , et 
al., 1996, FEBS Letters 384:177-180]. 
35 Substantial protective effect is observed; up to at 

least a one log enhancement of gene expression in rat muscle 
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over plasmid formulated in saline has been demonstrated with 
these exemplary PINC systems. We have also found that the 
expression of reporter genes in muscle using plasmids 
complexed with the PINC systems was more reproducible than 
5 when the plasmid was formulated in saline. For example, the 
coefficient of variation for reporter gene expression in 
muscle using plasmid formulated in saline was 96 + 35% ( n = 
20 studies; 8-12 muscles/study) whereas with coefficient of 
variation with plasmids complexed with PINC systems was 40 + 
10 19% (n = 30 studies; 8-12 muscles/study). The high 

coefficient of variation for reporter gene expression with 
plasmid formulated in saline has been described previously 
[Davis, H.L., et al . , 1993, Hum. Gene Ther. 4:151-9]. In 
addition, in contrast with the results for DNA: saline, there 
15 was no significant difference in gene expression in muscle 
when plasmid with different topologies were complexed with 
polyvinyl pyrrolidone (PVP) . This suggests that PVP is able 
to protect all forms of the plasmid from rapid nuclease 
degradation. 

20 1 ■ Summary of interactions between a PINC 

polymer (PVP) and plasmid 
We have demonstrated using molecular modeling that an 
exemplary PINC polymer, PVP, forms hydrogen bonds with the 
base pairs of a plasmid within its major groove and results 

25 in a hydrophobic surface on the plasmid due to the vinyl 
backbone of PVP. These interactions are supported by the 
modulation of plasmid zeta potential by PVP as well as by 
the inhibition of ethidium bromide intercalation into 
complexed plasmid. We have correlated apparent binding 

30 between PVP and plasmid to pH and salt concentration and 
have demonstrated the effect of these parameters on D-gal 
expression after intramuscular injection of plasmid/ PVP 
complexes [Mumper, R.J., et al., 1997. Submitted to Gene 
Therapy] . a summary of the physico-chemical properties of 

35 plasmid/PVP complexes is listed in Table I below. 
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Table I: Summary of the Physico-chemical Properties of 

Plasmid/PVP Complexes 


Method 

Molecular modeling 
Fourier- transformed 
Infra-red Hydrogen 
DNase I challenge 


Microtitration 
Calorimetry 

Potentiometric titration 
Dynamic Dialysis 


Zeta potential 
modulation 
Ethidium bromide 
Intercalation 

Osmotic pressure 

Luminescence 
Spectroscopy 


Result 

Hydrogen bonding and 
hydrophobic plasmid surface 
observed bonding demonstrated 
Decreased rate of plasmid 
degradation in the presence 
of PVP 

Positive heats of reaction 
indicative of-- an endothermic 
process 

One unit pH drop when plasmid 
and PVP are complexed 
Rate of diffusion of PVP 
reduced in the presence of 
plasmid 

Surface charge of plasmid 
decreased by PVP 
Ethidium bromide 
intercalation reduced by 
plasmid/PVP complexation 
Hyper-osmotic formulation 
(i.e., 34 0 mOsm/kg H 2 0) 
Plasmid/PVP binding decreased 
in salt and/or at pH 7 


10 


2 • Histology of expression in muscle 
Immunohistochemistry for p-gal using a slide scanning 
technology has revealed the uniform distribution of p-gal 
expression sites across the whole cross-sections of rat 
tibialis muscles. Very localized areas were stained positive 
for p-gal when CMV-p-gal plasmid was formulated in saline. 
P-gal positive cells were observed exclusively around the 
needle tract when plasmid was injected in saline. This is 
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10 


in agreement with previously published results (Wolff, J. A., 
et al., 1990, Science 247:1465-68; Davis, H.L., et al., 
1993, Hum. Gene Ther. 4:151-9; Davis, H.L., et al., 1993, 
Hum. Gene Ther. 4:733-40]. 

In comparison, immunoreactivity for P-gal was observed 
in a wide area of muscle tissue after intramuscular 
injection of CMV-0-gal plasmid/PVP complex (1:17 w/w) in 150 
mM NaCl. It appeared that the majority of positive muscle 
fibers were located at the edge of muscle bundles. Thus, 
staining for P~gal in rat muscle demonstrated that, using a 
plasmid/PVP complex, the number of muscle fibers stained 
positive for p-gal was approximately 8-fold greater than 
found using a saline formulation. Positively stained 

muscle fibers were also observed over a much larger area in 
15 the muscle tissue using the plasmid/PVP complex providing 
evidence that the injected plasmid was widely dispersed 
after intramuscular injection. 

We conclude that the enhanced plasmid distribution and 
expression in rat skeletal muscle was a result of both 
protection from extracellular nuclease degradation due to 
complexation and hyper-osmotic effects of the plasmid/PVP 
complex. However, Dowty and Wolff et al. have demonstrated 
that osmolarity, up to twice physiologic osmolarity, did not 
significantly effect gene expression in muscle [Dowty, M.E., 
25 and Wolff, J. A. In: J. A. Wolff (Ed.), 1994, Gene 

Therapeutics; Methods and Applications of Direct Gene 
Transfer. Birkhauser, Boston, pp. 82-98] . This suggests 
that the enhanced expression of plasmid due to PVP 
complexation is most likely due to nuclease protection and 
30 less to osmotic effects. Further, the surface modification 
of plasmids by PVP (e.g., increased hydrophobicity and 
decreased negative surface charge) may also facilitate the 
uptake of plasmids by muscle cells. 


20 
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3 . Structure-activity relationship of PINC 
polymers 

We have found a linear relationship between the 
structure of a series of co-polymers of vinyl pyrrolidon'e 
5 and vinyl acetate and the levels of gene expression in rat 
muscle. We have found that the substitution of some vinyl 
pyrrolidone monomers with vinyl acetate monomers in PVP 
resulted in a co-polymer with reduced ability to form 
hydrogen bonds with plasmids. The reduced interaction 
10 subsequently led to decreased levels of gene expression in 
rat muscle after intramuscular injection. The expression of 
p-gal decreased linearly (R - 0.97) as the extent of vinyl 
pyrrolidone monomer (VPM) content in the co-polymers 
decreased. 

15 These data demonstrate that pH and viscosity are not 

the most important parameters effecting delivery of plasmid 
to muscle cells since these values were equivalent for all 
complexes. These data suggest that enhanced binding of the 
PINC polymers to plasmid results in increased protection "and 

20 bioavailability of plasmid in muscle. 

4 . Additional PINC systems 

The structure-activity relationship described above can 
be used to design novel co-polymers that will also have 
enhanced interaction with plasmids. It is expected that 

25 there is "an interactive window of opportunity" whereby 
enhanced binding affinity of the PINC systems will result in 
a further enhancement of gene expression after their 
intramuscular injection due to more extensive protection of 
plasmids from nuclease degradation. It is expected that 

30 there will be an optimal interaction beyond which either 
condensation of plasmids will occur or "triplex" type 
formation, either of which can result in decreased 
bioavailability in muscle and consequently reduced gene 
expression . 
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15 


As indicated above th<=> ptmp ~ 
- mrih . e ' cne PINC compounds are generally 

Edltio ; ° a r rtr: of r mistry and physJs — 

10 pyrazoles im i d ,' ' includ ing Pyrroles, 

Hy-t-ctzoj.es, imidazoles, triazol w^u • ^ 

' Pyrones, dioxins, Dvrid-iri^e: ~ 
pyrimidines, pyrazines DiT »„ ■ Y Pyndazmes, 

groups, hereby incorporated by reference ° f theSe 

case C o 7 P ° Und l S alS ° C ° ntain hydrophobic croups which, in the 

acrylates, acrylamides, esters „, , et "yls, 

hydrides esters, celluloses, amides, 

ZalT ' Carbonates ' Phosphazenes, sulfones 

propylenes, and derivatives of these croups. The polar"; 

discussions of po r ari r y aS in lllUSt " ted ' exa.ple, by 

chenUstry textbook ' organic 

acidsTs alto'" f ^ m ° leCUleS ^ ±nte " Ct With ^leic 
acids is also understood by those skilled in the art and 

:u\v\:;;i c r r the use of °— <•* ™ 

addition lnteraCti0nS " Alternatively or in 

d " USln9 °" e ° r more of such tests as if 

determination of inhibition of the rate J f 

^estion 2) alteration of the zeta ; ot ; n a - al °: f :t d : s ; 

which indicates coating of DNA 3) nr ■ u-w- • 

y 3 ) or inhibition of the 
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ability of intercalating agents, such as ethidium bromide to 
intercalate with DNA. 

5 . Targeting Ligands 
In addition to the nucleic acid/PINC complexes 
5 described above for delivery and expression of nucleic acid 
sequences, in particular embodiments it is also useful to 
provide a targeting ligand in order to preferentially obtain 
expression in particular tissues, cells, or cellular regions 
or compartments. 

10 Such a targeted PINC complex includes a PINC system 

(monomeric or polymeric PINC compound) complexed to plasmid 
(or other nucleic acid molecule). The PINC system is 
covalently or non-covalently attached to (bound to) a 
targeting ligand (TL) which binds to receptors having an 

15 affinity for the ligand. Such receptors may be on the 
surface or within compartments of a cell. Such targeting 
provides enhanced uptake or intracellular trafficking of the 
nucleic acid. 

The targeting ligand may include, but is not limited 

20 to, galactosyl residues, fucosal residues, mannosyl 
residues, carnitine derivatives, monoclonal antibodies, 
polyclonal antibodies, peptide ligands, and DNA-binding 
proteins. Examples of cells which may usefully be targeted 
include, but are not limited to, antigen-presenting cells, 

25 hepatocytes, myocytes, epithelial cells, endothelial cells, 
and cancer cells. 

Formation of such a targeted complex is illustrated by 
the following example of covalently attached targeting 
ligand (TL) to PINC system: 

30 TL-PINC + Plasmid > TL-PINC :::::: Plasmid 

Formation of such a targeted complex is also 
illustrated by the following example of non-covalently 
attached targeting ligand (TL) to PINC system 
TL: : : : : : PINC + Plasmid > TL: : : : : : PINC: : : : : : Plasmid 

35 or alternatively, 
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PINC + Pl asmid > PINC- . Plasmid +TL ___ 

> TL: :::::PINC:::::::Plasmid 
In these examples is non-covalent interaction such 

as ionic, hydrogen-bonding, Van der Waals interaction 
5 hydrophobic interaction, or combinations of such 
interactions . 

A targeting method for cytotoxic agents is described in 
Subramanian et al., International Application No 
PCT/US96/08852, International Publication No. WO 96/3912^' 
hereby incorporated by reference. This application 

describes the use of poller affinity systems for targeting 
cytotoxic materials using a two-step ■ targeting method 

n: n b 9 ; iP P ° lymerS ' Pai " ° f ^eracting polymers. 

An antibody attached to one of the interacting polymers 

" lndS t0 * Cellul - t. W . Th at polymer then acts 7s a 
target for a second polymer attached to a cytotoxic agent 
As referenced in Subramanian et al., other two-step (or 
multi-step) systems for delivery of toxic agents are also 
described . 

in another aspect, nucleic acid coding sequences can be 
delivered and expressed using a two-step targeting approach 
involving a non-natural target for a PINC system or PINC- 
targeting ligand complex. Thus, for example, a PXNC-plasmid 
complex can target a binding pair member which is itself 
attached to a ligand which binds to a cellular target (e g 
a MAB). Binding pairs for certain Qf ^ ^ 

identified herein as PINC compounds as identified in 
Subramanian et al. Alternatively, the PINC can be complexed 
to a tareting ligand, such as an antibody. That antibody 
can be targeted to a non-natural target which binds to, for 
example, a second antibody. 


20 


25 


30 


IIT " Mod *l Systems for Eval„ a M on of mterf^ 
Constr ucts and Formulations 

In accord with the concept of using interferon alpha 
35 expressing plasmid constructs and formulations in anti- 
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cancer treatment, murine model systems were utilized based 
on murine tumor cell lines. The line primarily used was 
S.C. VII/SF, which is a cell line derived from murine 
squamous cell carcinoma (S.C). 
5 Squamous cell carcinoma of the head and neck begins 

with the cells lining the oral and pharyngeal cavities. 
Clinical disease progresses via infiltration and spreads 
into the underlying tissues and lymphatics. The 
undifferentiated, in vivo passage tumor line S.C. VII/SF 

10 displays this typical growth pattern. In addition, its 
rapid growth rate provides a relatively short test period 
for individual experiments. Other murine-" tumor cell lines 
include another SCC line KLN-205, a keratinocyte line 1-7, 
and a colon adenocarcinoma line MC-38. 

15 An optimal model system preferably satisfies the 

criteria based on having tumor growth rate in vivo (i.e., 
tumors are ready for treatment in 4-10 days post implant) , 
invasiveness, and local spread similar to those observed in 
clinical disease, and providing accessibility for 

20 experimental treatment. As indicated, the SCC VII/SF cell 
line was utilized as the primary model system cell line. 
This cell line typically grows rapidly, resulting in death 
of untreated syngeneic mice 14-17 days after tumor cell 
implantation. 

25 This cell line can be utilized in a variety of ways to 

provide model system suitable for a variety of different 
tests. Four such possibilities are described below. 

First, SCCVII cells can be utilized in cell culture to 
provide an in vitro evaluation of interferon alpha 

30 expression construct and formulation characteristics, such 
as expression levels and cellular toxicities. 

Second, the cells can be implanted subcutaneously in 
mice. This system can be utilized in tests in which 
accessibility of the implant site is beneficial. As an 

35 example, the method was utilized in evaluations of 
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expression of 
ranscutaneously into 


expression efficiencies based on the 
chloramphenicol acetyltransf erase (CAT) 

Third, the cells can be implanted t 
the fascia of digastric muscle 

5 into d° Urth ' C€llS ^ ^ imPl3nted transcutaneous^ 

into d.grastrxc/mylohyoid muscles. The important features 
of models 3 and 4 are shown in the table below 


TABLE II; Comparison of submandihni 


ar tumor models 


Feature 


Tumor implant 
procedure 


Tumor growth and 

invasiveness 

characteristics 


Mouse Tumor Model 3 


Treatment 
procedure 
(primary 
treatment) 


Days treated 
(post-implant) 


Measurement 
procedure 


2-4 x 10 s cells 
transcutaneously into 
fascia of digastric 
muscle 
Prominent 
submandibular bulge; 
invasion of 

digastric/mylohyoid 
muscles and 
lymphatics 


Transcutaneous, 
needle inserted and 
moved within tumor to 
produce a 4 quadrant 
distribution of gene 
medicine 


Mouse Tumor Model 4 
5 x 10 s 

transcutaneously into 
digastric/mylohyoid 
muscles 

More variable 
invasion of 
digastric/mylohyoid 
muscles and lymphatics 


Day 5, day 10 (both 
transcutaneously) 


Lower jaw skin flap 
raised to expose 
tumor, needle inserted 
and moved within tumor 
to produce a 4 
quadrant distribution 
of gene medicine 


External calipering 
2-3 x per week until 
death 


Day 5 (tumor exposed), 
day 8 

(transcutaneously) 
First caliper when 
tumor exposed for 
treatment, second 
caliper at sacrifice 
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Feature 

Mouse Tumor Model 3 

Mouse Tumor Model 4 

Advantages 

Non-surgical, closed 
model allows larger 
experiments and more 
frequent treatments; 
oauiiiicc unnecessary 
to caliper (=more 
time points) 

Surgical, open model 
allows direct 
treatment of exposed 
tumor; Local 
i n fl anuria t x on from 
surgery may 

additionally stimulate 
immune response; More 
like clinical 
situation for protocol 
development 

Disadvantages 

Transcutaneous 
treatment is 
potentially less 
accurate and 
intensive; less like 
expected clinical 
treatments than 
surgical approaches 

Labor intensive- 
Smaller, fewer 
experiments possible; - 
Tumors deeper and more 
difficult to treat 
transcutaneously (for 
secondary treatments) ; 
Fewer treatments and 
caliperings possible 


The tumor size treated in the mouse models is. generally 
20-50 mm 3 . A 50 mm 3 mouse tumor is approximately equivalent 
to 150 cc 3 human tumor having an average diameter of about 
6.6 cm. This tumor size is approximately 10-fold larger 


5 than the size proposed to be treated in the phase I clinical 
trials. This indicates that the mouse models are strongly 
biased towards over estimating the expected tumor burden in 
human patients. 

IV. Formulations for In Vivo Delivery 
10 A. General 

While expression systems such as those described above 
provide the potential for expression when delivered to an 
appropriate location, it is beneficial to provide the 
expression system construct (s) in a delivery system which 
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can assist both the delivery and the cellular uptake of the 
construct. Thus, this invention also provides particular 
formulations which include one or more expression system 
constructs (e.g., DMA plasmids as described above), and a 
5 protective, interactive non-condensing compound. 

An additional significant factor relating to the 
plasmid construct is the percentage of plasmids which are in 
a supercoiled (SC> form rather than the open circular (OC) 
form. 

10 B - Delivery and Expression 

A variety of delivery methods can -be used with the 
constructs and formulations described above, in particular 
delivery by injection to the site of a tumor can be used' 
The submandibular tumor models utilized injection into four 

15 quadrants of the tumor being treated. 

C - ft"ti-cancer Efficacy of Human Interferon Aloh* 
Formulations 

The effects of the administration of the interferon 
alpha formulations described above were evaluated using the 
S.C. VII mouse tumor models. Plasmid constructs as 

described above were incorporated in delivery formulations 
The formulations were delivered by injection. 

°- Synergistic Effects of i nterferon Alpha plasmid 
and IL-12 Plasmid and Effect of Human Interferon 


25 


30 


Alpha Formulation Admi nistration on Production »f 
Secondary Cytokines 
The effects of the expression of the human interferon 
alpha plasmids in tumor cells on the progress of the mouse 
tumors demonstrates that such interferon alpha expression is 
effective against such tumors. However, it was also shown 
that IL-12 can act synergistically with the interferon alpha 
expression to exercise the antitumor effect (see Figure 9) 
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E. Toxicity Evaluation of Exemplary Formulations 
The exemplary formulations do not show high cellular 
toxicity at the concentrations tested, suggesting that the 
formulations do not significantly kill cells by direct toxic 
action in vivo. Moreover, the anti-tumor activity induced 
by IFNa gene therapy is dependent upon activation of the 
immune system, which is demonstrated by depletion studies in 
vivo. Removal of a specific T lymphocyte population (CD8 + ) 
abrogates the anti-tumor activity elicited by IFNa gene 
therapy. 

V. Administration 

Administration as used herein refers to the route of 
introduction of a plasmid or carrier of DNA into the body. 
In addition to the methods of delivery described above, the 
expression systems constructs and the delivery system 
formulations can be administered by a variety of different 
methods . 

Administration can be directly to a target tissue or by 
targeted delivery to the target tissue after systemic 
administration. In particular, the present invention can be 
used for treating disease by administration of the 
expression system or formulation to the body in order to 
establishing controlled expression of any specific nucleic 
acid sequence within tissues at certain levels that are 
useful for gene therapy. 

The preferred means for administration of vector 
(plasmid) and use of formulations for delivery are described 
above. The preferred embodiments are by direct injection 
using needle injection. 

The route of administration of any selected vector 
construct will depend on the particular use for the expres- 
sion vectors. In general, a specific formulation for each 
vector construct used will focus on vector uptake with 
regard to the particular targeted tissue, followed by 
demonstration of efficacy. Uptake studies will include 
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uptake assays to evaluate cellular uptake of the vectors and 
expression of the DNA of choice. Such assays will also 
determine the localization of the target DNA after uptake 
and establishing the requirements for maintenance of steady- 
5 state concentrations of expressed protein. Efficacy and 
cytotoxicity can then be tested. Toxicity will not only 
include cell viability but also cell function. 

Muscle cells have the unique ability to take up DNA 
from the extracellular space after simple injection of DNA 
10 particles as a solution, suspension, or colloid into the 
muscle. Expression of DNA by this method can be sustained 
for several months. 

Delivery of formulated DNA vectors involves 
incorporating DNA into macromolecular complexes that undergo 
15 endocytosis by the target cell. Such complexes may include 
lipids, proteins, carbohydrates, synthetic organic 
compounds, or inorganic compounds. Preferably, the complex 
includes DNA, a cationic lipid, and a neutral lipid in 
particular proportions. The characteristics of the complex 
20 formed with the vector (size, charge, surface character- 
istics, composition) determines the bioavailability of • the 
vector within the body. other elements of the formulation 
function as ligand which interact with specific receptors on 
the surface or interior of the cell. other elements of the 
formulation function to enhance entry into the cell, release 
from the endosome, and entry into the nucleus. 

Delivery can also be through use of DNA transporters. 
DNA transporters refers to molecules which bind to DNA 
vectors and are capable of being taken up by epidermal 
cells. DNA transporters contain a molecular complex capable 
of noncovalently binding to DNA and efficiently transporting 
the DNA through the cell membrane. It is preferable that 
the transporter also transport the DNA through the nuclear 
membrane. See, e.g., the following applications all of 
35 which (including drawings) are hereby incorporated by 
reference herein: (1) Woo e t al., U.S. Serial No 
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07/855,389, entitled "A DNA Transporter System and Method of 
Use,, filed March 20, 1992, now abandoned; (2) Woo et al., 
PCT/US93/02725, International Publ . W093/18759, entitled "A 
DNA Transporter System and Method of Use", (designating the 
U.S. and other countries) filed March 19, 1993; (3) 
continuation-in-part application by Woo et al., entitled 
"Nucleic Acid Transporter Systems and Methods of Use", filed 
December 14, 1993, U.S. Serial No. 08/167,641; (4) Szoka et 
al. , U.S. Serial No. 07/913,669, entitled "Self-Assembling 
Polynucleotide Delivery System", filed July 14, 1992 and (5) 
Szoka et al . , PCT/US93/03406, International Publ . W093/19768 
entitled "Self -Assembling Polynucleotide -Delivery System", 
(designating the U.S. and other countries) filed April 5, 
1993. A DNA transporter system can consist of particles 
containing several elements that are independently and non- 
covalently bound to DNA. Each element consists of a ligand 
which recognizes specific receptors or other functional 
groups such as a protein complexed with a cat ionic group 
that binds to DNA. Examples of cations which may be used 
are spermine, spermine derivatives, histone, catipnic 
peptides and/or polylysine. One element is capable of 
binding both to the DNA vector and to a cell surface 
receptor on the target cell. Examples of such elements are 
organic compounds which interact with the asialoglycoprotein 
receptor, the folate receptor, the mannose-6-phosphate 
receptor, or the carnitine receptor. A second element is 
capable of binding both to the DNA vector and to a receptor 
on the nuclear membrane. The nuclear ligand is capable of 
recognizing and transporting a transporter system through a 
nuclear membrane. An example of such ligand is the nuclear 
targeting sequence from SV40 large T antigen or histone. A 
third element is capable of binding to both the DNA vector 
and to elements which induce episomal lysis. Examples 
include inactivated virus particles such as adenovirus, 
peptides related to influenza virus hemagglutinin, or the 
GALA peptide described in the Szoka patent cited above. 
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10 


35 


Transfer of genes directly into a tumor has been very 
effective. Experiments show that administration by direct 
injection of DNA into tumor cells results in expression of 
the gene in the area of injection. Injection of plasmids 
containing human interferon alpha results in expression of 
the gene for 5 days following a single intra-tumoral 
injection. Human IFNa production was highest in tumors 
harvested 1 day post-tumor injection and steadily declined 
thereafter. The injected DNA appears to persist in an 
unintegrated extrachromosomal state. This means of transfer 
is a preferred embodiment. 

Administration may also involve lipids as described in 
preferred embodiments above. The lipids may form liposomes 
which are hollow spherical vesicles composed of lipids 
15 arranged in unilamellar, bilamellar, or multilamellar 
fashion and an internal aqueous space for entrapping water 
soluble compounds, such as DNA, ranging in size from 0.05 to 
several microns in diameter. Lipids may be useful without 
forming liposomes. Specific examples include the use of 
cationic lipids and complexes containing DOPE which interact 
with DNA and with the membrane of the target cell to 
facilitate entry of DNA into the cell. 

Gene delivery can also be performed by transplanting 
genetically engineered cells. For example, immature muscle 
cells called myoblasts may be used to carry genes into the 
muscle fibers. Myoblast genetically engineered to express 
recombinant human growth hormone can secrete the growth 
hormone into the animal's blood. Secretion of the incorpor- 
ated gene can be sustained over periods up to 3 months. 

Myoblasts eventually differentiate and fuse to existing 
muscle tissue. Because the cell is incorporated into an 
existing structure, it is not just tolerated but nurtured. 
Myoblasts can easily be obtained by taking muscle tissue 
from an individual who needs gene therapy and the 
genetically engineered cells can also be easily put back 
with out causing damage to the patient's muscle. Similarly, 
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keratinocytes may be used to delivery genes to tissues. 
Large numbers of keratinocytes can be generated by 
cultivation of a small biopsy. The cultures can be prepared 
as stratified sheets and when grafted to humans, generate 
5 epidermis which continues to improve in histotypic quality 
over many years. The keratinocytes are genetically 

engineered while in culture by transfecting the 
keratinocytes with the appropriate vector. Although 
keratinocytes are separated from the circulation by the 
10 basement membrane dividing the epidermis from the dermis, 
human keratinocytes secrete into circulation the protein 
produced. 

The chosen method of delivery should result in 
expression of the gene product encoded within the nucleic 

15 acid cassette at levels which exert an appropriate 
biological effect. The rate of expression will depend upon 
the disease, the pharmacokinetics of the vector and gene 
product, and the route of administration, but should be in 
the range 0.001-100 mg/kg of body weight /day, and 

20 preferably 0.01-10 mg/kg of body weight/day. This level is 
readily determinable by standard methods. It could be more 
or less depending on the optimal dosing. The duration of 
treatment will extend through the course of the disease 
symptoms, possibly continuously. The number of doses will 

25 depend upon the disease, delivery vehicle, and efficacy data 
from clinical trials. 

Examples 

The present invention will be more fully described in 
conjunction with the following specific examples which are 

30 not to be construed in any way as limiting the scope of the 
invention. As shown below, mIFN- gene medicine reduces the 
growth of tumors in syngeneic murine tumor models. Lipid 
formulations of mIFN- gene medicine display anti tumor 
activity in both SCC-VTI and MC-38 tumor models. PINC and 

35 peptide formulations of ml FN- gene medicine display anti 
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tumor effects in the MC-38 tumor model. The anti tumor 
effects of ml FN- gene medicine are dose dependent. m 
addition, the examples demonstrate that treatment of tumors 
with the combination of IFNa and IL-12 gives an 
5 unanticipated more than additive (synergystic) anti-tumor 
activity using either a PINC or a lipid formulation. 

Example 1 

A plasmid expression system encoding murine IFNa4 and 
formulated in a polymeric delivery system was used for in 

10 vivo immunotherapeutic activity aaain^t 

- Ll - y against an immunogenic 
murine renal cell carcinoma, Renca, and - a non-immunogenic 
mammary adenocarcinoma, TS/A. Mice bearing established 
tumors were treated with IFNa/polyvinyl-pyrrolidone (PVP) 
expression compiexes via direct intra-tumoral injection. Op 
to 100 % tumor growth inhibition was observed in the treated 
mice. By using an optimal dose of 96 and 4 8 ug of 
formulated I FN— a plasmid for the treatment of Renca and TS/A 
respectively, 30% (Renca) and 10% (TS/A) of the treated 
animals remained tumor-free. Tumor inhibition was dependent 
20 upon activation of the immune system. The anti-tumor 
activity elicited by IFN-a gene therapy was abrogated when 
mice were selectively depleted of CD8 + T cells. By contrast 
removal of CD4* resulted in increased tumor rejection 
following IFN-a/PVP treatments. Finally, mice that remained 
25 tumor-free following I FN— a gene therapy displayed immune 
resistance to a subsequent challenge of tumor. These data 
provide evidence that non-viral IFNa gene therapy can be 
used to induce an efficient anti-tumor response. 

Local presence of cytokines in tumors can activate an 
30 immune response that in some cases leads to induction of 
specific long-lasting anti-tumor immunity. By direct intra- 
tumoral injection of plasmid encoding murine IFNa 4 and 
formulated in a polymeric delivery system, tumor-bearing 
mice develop an immune response, which leads to inhibition 
35 and eradication of the tumor. We have shown by depletion 
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studies in vivo that the immune response induced by IFNa is 
mainly CD8-mediated and that this treatment results in a 
long-term immunity in mice demonstrating complete tumor 
regression. Thus, non-viral IFNa gene therapy may be an 
5 effective alternative to IFNa protein therapy for human 
cancers . 

Transduction of tumor cells with cytokine genes has 
proven to be a very efficient technique to induce cytokine- 
mediated anti-tumor immunity. In experimental models, the 

10 local presence of IL-2, IL-1, IL-4, IL-6, IL-7, IL-12, IFNs 
and CSFs (i.e., GM-CSF) at the site of the tumor can result 
in significant tumor growth inhibition- (Colombo et al., 
"Local Cytokine Availability Elicits Tumor Rejection and 
Systemic Immunity Through Granulocyte-T-Lymphocyte Cross- 

15 Talk", Cancer Research , 52, 4853-4857 (1992)). In these 
systems, cytokines have limited effect on tumor 
proliferation directly but are capable of activating a rapid 
and potent ant i -tumor immune response, which impedes tumor 
progression. Established parental tumors, however, are 

20 difficult to eradicate with ex vivo cytokine-transduced 
tumor cells because efficacy of vaccination is highly 
dependent on the size, growth rate and invasiveness of the 
tumor . 

To overcome these problems, cytokine-based gene therapy 
25 approaches, which can deliver transgenic cytokines locally 
and induce an anti-tumor immune response, have been recently 
evaluated by a number of investigators (Forni et al., 
"Cytokine-Induced Immunogenicity : From Exogenous Cytokines 
to Gene Therapy", Journal of Immunotherapy , 14, 253-257, 
30 (1993); Pericle et al . , "An Efficient Th2-type Memory 
Follows Cd8+ Lymphocyte-driven and Eosinophil-mediated 
Rejection of a Spontaneous Mouse Mamary Adenocarcicoma 
Engineered to Release 11-4", The Journal of Immunology , 153, 
5660-5673. (1994); Pardoll et al., "Gene Modified Tumor 
35 Vaccines, In Cytokine-Induced Tumor Immunogenicity", eds . 
Academic Press , London, p. 71-86. (1994); and Musiani et 
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al., "Cytokines, Tumor-cell Death and Immunogenicity • A 
Question of Choice", Immunology Tod.y . i, 32 -36 (1997)) 
Technological breakthroughs in gene therapy using 
adenoviral, retroviral, and liposomal vectors have provided 
powerful tools with which to study the biological effects of 
specific cytokine mediators as well as to develop novel and 
clinrcally applicable anti-tumor immunotherapies (Pardoll 
"Paracrine Cytokine Adjuvants in Cancer Immunotherapy",' 
Annu. Rev. Immunol . 13, 399-415 (1995); Bramson et al 
"Direct Intratumoral Injection of an Adenovirus Expressing 
lnterleukin-12 Induces Regression and Long-lasting Immunity 
That is Associated with Highly Localized Expression of 
lnterleukin-12". Hum. Gene Thar.. 7, 1995-2002 (1996); Rao 

T a1 ".' " I1_12 13 a " Eff6Ctive Adjuvant to Recombinant 
15 vaccinia Virus-based Tumor Vaccines", J. Immunol. i 56 
3357-3365. 1996; Rakhmilevich et al., "Gene Gun-mediated 
Skin Transfection with Interleukin 12 Gene Results in 
Regression of Established Primary and Metastatic Murine 
Tumors", Proc. Natl. Acad. Sci . ns» 9 3, 6291-6296 (1996) • 
20 and Rakhmilevich et al, "Cytokine Gene Therapy of Cancer 
Using Gene Gun Technology: Superior Antitumor Activity of 
lnterleukin-12". Hum. Gene Ther . 8, 1303-1311, (1997)). 

A gene therapy approach utilizing an interactive 
polymeric gene delivery system that i ncrease s protein 
expression by protecting plasmid DNA ( pDNA ) from nucleases 
and controlling the dispersion and retention of pDNA in 
muscle cells is described in Mumper et al., 1996. These 
polymeric interactive non-condensing (PiNC) systems 
routinely result in a greater amount of gene expression from 
tissues as compared to delivery of unformulated plasmid in 
saline (Mumper et al., 1996). By using a plasmid that 
encodes human insulin growth factor-1 (hIGF-1) an d 
formulated as a PINC complex, production of biologically 
active h IGF-1 in vivo following intramuscular injection 
35 has been shown (Alila et al., "Expression of Biologically 
Active Human Insulin-Like Growth Factor-1 Following 
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Intramuscular Injection of a Formlated Plasmid in Rats", 
Human Gene Therapy , 8, 1785-1795 (1997)). The specific 
objective of this study was to determine whether a plasmid 
expression system encoding murine IFNa4 and formulated as a 
5 complex with PVP could induce an anti-tumor immune response 
following direct injection into subcutaneous murine tumors. 

The I FN family consists of three major glycoproteins, 
IFNa, IFNP and IFNy . Although IFNs were first developed as 
antiviral agents, it is now clear that they also control 

10 cell growth and differentiation, and modulate various 
aspects of host immunity (Gresser et al., "Antitumor effects 
of interferon", Acta Oncol . 28, 347-353 --(1989)). Clinical 
data concluded that systemic chronic administration of IFNa 
could produce regression of vascular tumors, including 

15 Kaposi's sarcoma, pulmonary hemangioraastosis , and 
hemangiomas (Singh et al., "Interferons A and B Down- 
regulate the Expression of Basic Fibroblast Growth Factor in 
Human Carcicomas", Proc. Natl. Acad. Sci . USA . 92, 4562-4566 
(1995) ) . Although IFNa was the first cytokine to be used in 

20 clinical trials that proved to be effective against certain 
types of human cancer, only recently has this cytokine been 
considered as a candidate for gene therapy (Ogura et al. 
1993, Belldegrun et al., "Human Renal Carcinoma Line 
Transfected With Interleukin-2 and/or Interferon a Gene(s): 

25 Implications for Live Cancer Vaccines, Journal of the 
National Cancer Institute , 85, 207-216 (1993) . 

Initial studies have shown that the injection of 
genetically modified tumor cells producing IFNa into 
syngeneic mice induces tumor growth inhibition and elicits a 

30 tumor-specific immune memory (Ferrantini et al., Interferon 
Alpha-l-Interferon Gene Transfer into Metastatic Friend 
Lukemia Cells Abrogated Tumorigenicity in Immunocompetent 
Mice: Antitumor Therapy by Means of Inter f eron-Producing 
Cells; Cancer Res . 53, 1107-4615 (1993); Ferrantini et al., 

35 "Ifn-al Gene Expression into a Metastatic Murine 
Adenocarcicoma (Ts/a) Results in Cd8+ T Cell-Mediated Tumor 
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Studies ,K DeVel ° Pment ° f A ^i tUm or Xmmunitv: Comparative 

Studxes wxth Ifn-y-producing Ts/a Cells „ Jour * * 
Xmmunology, 153 , 460,-4615, (1994); Musiani aJ f 

However, the real value of this potential form of va'ccxne In 

zzzri — ° f est — — - - 

In this study we present evidence that direct in-io <- ■ 

- IFN « pias^d formulated in PVP into subc ^r: 

tumors results i n a host -dependent tumor reaction 
PnMrlly mediated *y ™+ » cells, and elicits a protect^ 
immunity against subsequent tumor re-challenge. 

Material s And Method* 

Plasmid const r., c ti on and f ormulaticin 

A pl asm id expression system containing an expression 
cassette for mI FN - la4 was constructed as ^ 

z;;; e of the murine ira -° 4 bank x 0 i 973 M1545 6 M238 3o 

X0 19 67, was amplified by pcR £rom genom . c ^ t ^ 

am P iifi ed mIFN _ a4 sequence then subcioned ^ ^^^^ 

a 7 0ne ' SeqUenCe fidelit V «■ verified by DNA seguence 

analyse (data not shown) ^ ^ ^ 

then subclone. as an Xhal-BamtUfr.gment iato the expression 
Pl.«xd PXL06.7 to create the mIFN - a4 expression System pIF0836 
Pl.«xd PVC0612 ( e m p ty pl asmid , EP) contains expressio „ ^ 

2 5 Z p U oT! A1 the — — * earl, promoter and the L 

LcT h S19nal fr ° m b ° Vine gr ° Wth in the PVC028 9 

hac kbone described by ftlila ^ ^ (i997> piasmid P 

L i" * contr ? piasmid in ai1 in vi ™ — 

mtra-tumoral injection were grown under kanamvcin selection 
» E. coli host strains DH5a a „d purified using conventional 

util^e" T 3 Chr0n,at0g " PhiC -thod,. Purified P l asmid 

u tl l lzed for xntra-tumoral injections had the following 
speculations: endotoxin ,< 500 Eu/mg p las mid, , protein « 1%) 
and chromosomal DN A « 20 %) . Purified pIF0836 ^ J 
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plasmids were formulated at a concentration of 3 mg DNA/ml in a 

solution of 5 % w/v polyvinyl-pyrrolidone (Plasdone C-30, ISP 

Technologies, Wayne, NJ) , 150 mM NaCl on the day of injection, as 
described previously (Mumper et al., 1996). 

5 Western blot analysis and bioassay for mIFNa . 

HeLa cells were plated in 6 well plates at 3 x 10 6 cells 
per well, and transfected using 1 ajig of mouse IFNa4 plasmid 
pIF0836C and 3 u.g of Lipof ectamine (Life Technologies, Inc., 
Gaithersburg, MD) in serum-free DMEM. Same supernatants 

10 were harvested 24 hours later and immunoprecipitated using 
anti-mouse interferon a/p polyclonal antibody (BioSource 
International, Camarillo, CA) and protein A and G agarose 
(Boehringer Mannheim, Indianapolis, IN) . Samples were run 
on a 12% Tris-glycine gel and electroblotted to Millipore 

15 PVDF membrane. Anti-mouse interferon ot/p polyclonal antibody 
was used at 1:1000, followed by anti-sheep Ig HRP 
(Boehringer Mannheim) at 1:1000. Biotinylated molecular 
weight markers were detected using Streptavidin-HRP 
(Amersham, Arlington Heights, IL) . Detection was performed 

20 using the Amersham ECL kit. Supernatants were also tested 
for IFNct biological activity using L929 cells treated with 
encephalomyocarditis virus, in parallel with a NIH mouse 
IFNa reference reagent (Access Biomedical, San Diego, CA) . 

Animals 

25 Normal 8-week-old female BALB/c mice were purchased 

from Harlan Laboratories, Houston, TX. Mice were maintained 
on ad libitum rodent feed and water at 23° C, 40% humidity, 
and a 12-h/12-h light-dark cycle. Animals were acclimated 
for at least 4 days before the start of the study. 

30 Tumors 

Three established mouse tumor models were used in this 
study. TS/A is a tumor cell line established by Dr. P. 
Nanni, University of Bologna, Italy, from the first in vivo 
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transplant of a moderately differentiated mammary 
adenocarcinoma that spontaneously arose in a BALB/c mouse 
(Nanni et al . , 1983). A number of pre - immunization _ 
challenge experiments suggested that TS/A does not elicit a 
5 long-lasting anti-tumor immunity (Forni et al., 1987) TS/A 
was generously provided by Dr. Guido Forni, University of 
Turin, Italy. Renca, a spontaneously arising murine renal 
cell carcinoma, and CT-26, a colon adenocarcinoma, were 
generously provided by Dr. Drew M . Pardoll, John Hopkins 
10 Hospital, Baltimore, MD. Tumor cell cultures were 

maintained in sterile disposable flasks from Corning 
(Corning, NY, at 37° c in a humidified 5% CO, atmosphere 
using RPMI 1640 supplemented with 10% FBS, 100 U/ml 
penicillin, 100 U/ml streptomycin and 50 ug/ m l gentamycin; 
15 all from Life Technologies. 

In vivo evaluation of tum or growth and treatments 

BALB/c mice were challenged s.c. in the middle of the 
left flank with 30 ul of a single-cell suspension contained 
the specified number of cells. Seven days later when the 
20 tumor size reached approximately 10 mm 3 , treatments with 
IFNa/PVP or EP/PVP started and were repeated at 1-2 day 
intervals for 2 weeks (total of 8 treatments: 4/week) 
Tumor volume was measured with electronic caliper in the two 
perpendicular diameters and in the depth. Measurements of 
25 the tumor masses (mm 3 ) were performed twice a week for 40-50 
days. All mice bearing tumor masses exceeding 1 cm3 volume 
were sacrificed for humane reasons. When depletion of 
immunocompetent cells in vivo was required, a group of mice 
received i.v 0.5 ml of a-CD4 (GK1.5 hybridoma, 207-TIB 
30 ATCC, Rockville, MD) ascite (1:10), or cc-CDS (2 43 
hybridoma, 210-TIB, ATCC) ascite (1:100) or i.p. l 00 ug 
a-GRl (RB6-8C5 hybridoma, Pharmingen, San Diego, C A) 
Control mice received i.v. 0.5 ml isotype control I gG 
(Pharmingen). Antibody treatments were performed twice- 
35 first injection 1 day before starting the gene therapy 
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treatment and the second injection (i.p at the same dosage) 
7 days later. 

CTL assay 

A standard 6-hour 51-chromium ( 51 Cr) -release assay was 
5 performed following 5 days of in vitro effector cell 
stimulation. Single cell suspensions of splenocytes were 
prepared 3 weeks following tumor challenge by mashing the 
spleens in RPMI 1640 medium (Life Technologies) and passing 
the cells through 70 um nylon mesh cell strainers (Falcon, 

10 Becton Dickinson, Lincoln Park, NJ) into 50ml centrifuge 
tubes (Falcon) . After centrif ugation, red blood cells were 
lysed with ACK Lysing Buffer (Biofluids, Inc., Rockville, 
MD) and the splenocytes washed twice with RPMI . In vitro 
stimulation cultures contained 3 X 10 6 splenocytes/ef f ectors 

15 per ml with 6 X 10 5 mitomycin-C-treated Renca/stimulator 
cells per ml and 10 Units per ml recombinant murine IL-2 
(Genzyme, Cambridge, MA) in RPMI containing 10% FBS, 22mM 
HEPES buffer (Research Organics Inc., Cleveland OH), Penn- 
Streptomycin, 5 X 10" 5 M 2-p-mercapto-ethanol (Life 

20 Technologies), OPI media supplement (Sigma, St. Louis, MO), 
and essential and non-essential amino acids (Life 
Technologies) (for a 5 : 1 responder : stimulator ratio) . 
Stimulators were prepared by incubating Renca cells at 3 X 
10 7 per ml in RPMI with 30 ug per ml mitomycin-C (Sigma) at 

25 37° C for 60 minutes, followed by four washes in HBSS with 
2.5% FBS . After 5 days at 37° C, effector cells were 
pelleted, resuspended in complete RPMI, counted, and mixed 
with 51 Cr -labeled targets in a 96 well V-bottomed plate 
(Costar/Corning, Cambridge, MA) . Renca and CT26 targets 

30 were labeled by incubating them at 2 X 10 6 cells per ml in 
complete RPMI with 150 uCi 51 Cr (Amersham) for 2.5 hours. 
Targets were washed 3 times in HBSS with 2.5% FBS and 
resuspended in complete RPMI before addition to the assay. 
After mixing effectors and targets (in triplicate wells) and 

35 a brief pelleting, plates were placed at 37° C for 6 hours. 
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Approximately 90% of the supernatants were then collected 
from each well with the Skatron Harvesting Press and 
Supernatant Collection System (Skatron Instruments, Norway). 

Cr release was detected using a WALLAC 1470 Wizard 
automatic gamma counter (WALLAC Inc., Gaithersburg MD) 
Specific release was determined with the following equation: 
(experimental cpm - spontaneous cpm) / (total cpm- 
spontaneous cpm) x 100. Spontaneous release from the 
targets was less than 18%, and the standard error of the 
triplicate experimental counts was less than 14%. 

Statistical analysis 

Data for the effects of mIFN-a gene therapy on tumor 
growth were analyzed by repeated measures analysis. 
Individual treatment means were compared using Duncan's 
multiple range test when the main effect was significant 
Data for the effect of mIFN-a gene therapy on tumor 
rejection were analyzed by ANOVA. In all cases a p value of 
less than 0.05 was considered to be statistically 
significant . 

20 Results 

Expression of mIFN-a 

Murine I FN -a expression plasmid (pIF0836) was 
transfected into Cos-1 cells, and the resulting conditioned 
media was assayed for mIFN-a by Western blot and by 
25 bioassay. Western blot analysis of conditioned media 
indicated that the recombinant mIFN-a expressed from pIF0836 
template was present as a single band with an approximate 
molecular weight of 23 kDa . This band was not observed in 
conditioned media from mock-transf ected cells and likely 
30 represents a glycosylated form of mIFN-o. Recombinant mlFN- 
oc ran with an approximate molecular weight of 18 kDa, which 
corresponds to the predicted molecular weight of non- 
glycosylated mIFN-a. Assay of conditioned media using an 
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anti-viral bioassay for mIFN-a indicated that approximately 
175 x 10 3 IU/ml mIFN-a were present. 

Anti-tumor activity of I FN -a gene therapy. The anti- 
tumor effect of murine IFNa4 plasmid formulated as a complex 
5 with PVP (IFNot/PVP) was tested in a syngeneic murine renal 
cell carcinoma (Renca) and a mammary adenocarcinoma (TS/A) 
tumor model. BALB/c mice were challenged subcutaneously 
with 7 X10 5 Renca or 1 X10 5 CT26 cells, and IFNa/PVP 
injections were initiated 7 days later when tumors reached 

10 approximately 10 mm 3 size. Each group of mice received at 
interval of 1-2 days 8 treatments (4 injections/week) of 
IFNa/PVP at scalar doses (from 12 to 96 jig /mouse ) , EP/PVP 
(96 ug /mouse) or no treatments for control (Ctrl) . Tumor 
size increased progressively in mice injected with EP/PVP 

15 (Renca, TS/A) or low doses of IFNa/PVP (TS/A), while tumors 
in mice injected with each dose of IFNa/PVP (Renca) or high 
dose of IFNa/PVP (TS/A) showed marked growth inhibition. 

Tumor growth inhibition is associated to systemic immune 
response 

20 Treatments of Renca and TS/A tumors with IFNa/PVP at 96 

ug/mouse and 48 jig/mouse respectively, induced complete 
regression in 6 out of 20 (Renca) and 2 out of 20 (TS/A) of 
challenged mice. To test whether the rejection of these 
tumors leads to specific immune memory, mice with no tumors 

25 for 40-50 days following IFNa treatments were re-challenged 
with a greater number of fresh tumors in the right flank. 
All mice that rejected primary tumors displayed protection 
against the second tumor challenge whereas mice used as the 
control group and injected for the first time with same 

30 number of tumor cells (1 X 10 6 Renca or 2 X 10 s TS/A) 
developed tumors . 

To evaluate the requirements for the induction of anti- 
tumor immune memory, Renca and TS/A were injected into 
BALB/c rendered immunosuppressed by treatment with anti-CD4, 

35 anti-CD8 or anti-polymorphonuclear cells { PMN) . Depletion 
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of CD8 + T cells allowed both Renca and TS/A to grow in all 
ammals following IFNa/PVP treatments, showing that this 
population is crucial for the immune response induced by 
IFNct gene therapy. No increase in tumor growth was found in 
mice treated with anti-PMN (cc-GRl ) monoclonal Ab (rtb , 
Increase in tumor rejection was observed in mice depleted of 
CD4 T and treated with IFNa/PVP suggesting that depletion 
of CD4 T cells can enhance the anti-tumor effect of i FNa 
gene therapy. 

Expression of IFN-oc within the tumor induces a CTL 
response. To assess whether CDS* tumor specific CTL were 
mduced in vivo by IFNa/PVP treatments; splenocytes from 
Renca tumor-challenged mice were tested for their cytolytic 
activity following IFNa gene therapy. Cytotoxic activity 
against Renca, and not against CT26 cells used as control 
for tumor specificity, was found in 2 of 4 mice that had 
received IFNa gene therapy. Moreover, splenocytes from mice 
depleted of CD4* T cells and treated with IFNa/PVP 
demonstrated potent CTL activity against Renca cells. By 
20 contrast, little CTL activity against Renca cells was 
evident from splenocytes isolated from mice treated with 
EP/PVP. 


15 


Discussion 

The data reported herein demonstrate that direct 
25 administration of IFNa gene formulated in a polymeric 
delivery system into subcutaneous renal cell carcinoma and 
mammary adenocarcinoma inhibits tumor growth and induces a 
long-lasting immunity to secondary tumor challenges of 
considerable significance is the fact that the anti-tumor 
response was observed against both an immunogenic carcinoma 
as well a more aggressive and poorly immunogenic 
adenocarcinoma, a phenotype which is similar to many 
spontaneously arising tumors in man. 

A variety of genetic abnormalities arise in human 
cancers that contribute to neoplastic transformation and 
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malignancy. Despite increasing understanding of the 

molecular basis of cancer, many malignancies remain 
resistant to established forms of treatment. More recently, 
molecular genetic interventions have been designed in an 
5 attempt to improve the efficacy of immunotherapy. While 
numerous experimental studies have been performed in murine 
models with tumor cells transduced with cytokine-gene ex 
vivo, a major limitation in the translation of this strategy 
to large-scale human tumor vaccine therapy is the labor 
10 intensity and variability of establishing each individual 
tumor in' culture and transducing it with an appropriate 
vector (i.e., retrovirus). Our approach -to this problem is 
to use a non-viral delivery system to modify tumor cells in 
vivo with cytokine cDNAs so that the tumor cells can supply 
15 the cytokine of interest in a paracrine fashion to the anti- 
tumor responder cells present within the tumor. 

Using a plasmid containing IFNa4 gene and formulated in 
PVP, we have shown that intra-tumoral injections of this 
DNA-PINC complex can lead to complete tumor regression in 30 
20 % of the cases (Renca model) with an overall response rate 
of 100 % tumor growth inhibition. These results are in 
agreement with a recent study that described an anti-tumor 
activity elicited by genetically modified TS/A cells 
producing murine IFN-al (Ferrantini et al . , 1994). Although 
25 the anti-tumor effect of I FNa using cytokine-gene transduced 
tumor cells has been described (Scarpa et al . , 
-Extracellular Matrix Remodelling in a Murine Mamary 
Adenocarcicoma Transfected with the Interf eron-alphal Gene", 
Journal of Pathology. 181, 116-123 1997), the real value of 
30 I FNa gene therapy in blocking or inhibiting advanced tumors 
remains to be explored. The advantage of using a non-viral 
I FNa gene delivery system over retrovirus is that tumor 
cells could be transduced directly in vivo without the need 
of first establishing tumor cells in vitro. Moreover, I FNa 
35 has a potent anti-viral activity limiting the use of this 
gene in combination with viral vectors. 
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CD4 + T cells does not deprive the tumor of growth factors 
(Rackmilevich et al., 1994). A possible explanation for 
this phenomenon is that depletion of CD4 + T cells from tumor- 
bearing mice augments the anti-tumor efficacy of IFNa- 
5 activated CD8 + T cells by releasing them from 
immunosuppression. The mechanism driving CD4 suppression is 
poorly understood, although Th2 type cytokines, directly or 
through B cell activation, may inhibit cell-mediated 
immunity (Mossman et al., 1989; Powrie et al . , Eur-J- 

10 Immunol , 23 ( 11 ): 3043-9 (1993)). CTL can be generated in 
both CD4-depleted and non-depleted mice from lymphocytes 
obtained from spleens by in vitro re-stimulation with 
mitomycin-treated Renca cells and rIL-2. Thus, CD4-mediated 
suppression appears to be exerted on CD8 expansion and not 

15 priming. In accord with the in vivo results, stronger CTL 
activity was observed from mice depleted of CD4 and treated 
with IFNa/PVP suggesting CD4 + T cells inhibit an IFNct- 
mediated CD8 + T cell response in vivo. This study suggests 
that direct administration of cytokine genes, like IFNct, 

20 into tumors, which have been found to suppress malignancy 
growth, provide a new therapeutic option for the treatment 
of human cancers. 

Example 2: Pharmacology of ml FN - Gene Medicine in Syngeneic 
Tumor Models 

25 Gene expression systems encoding either mIFN-2 or mlFN- 

4 were tested for anti tumor activity when formulated in 
either cationic lipid, peptide, or PINC delivery systems and 
injected intratumorally into subcutaneous squamous cell 
carcinoma (SCC-VII) or adenocarcinoma (MC-38) tumors. 

30 Experimental design and treatment regimen 

Experiments to test the anti tumor activity of mlFN- 
gene medicine were conducted in either SCC-VII or MC-38 
tumor models. Tumor cells (4 x 10 s ) were injected 

subcutaneously into the flank region of mice, and treatment 
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there is no effect of mlFN- gene medicine when formulated 
in PVA. In addition, analysis of tumors from this 

experiment using immunohistochemical methods revealed 
infiltration of CD8 + lymphocytes in tumors injected with 
5 cationic lipid formulations, but not in tumors injected with 
PVA formulations. 

mlFN- gene medicine significantly reduces the growth of 
SCC-VII tumors as compared to control- plasmid or lactose 
injected tumors. Differences between control plasmid and 

10 ml FN- plasmid were consistent across formulation. Plasmid 
dose was 4 6 pg/treatment . Growth of tumors injected with 
control plasmid was compared to that of tumors injected with 
ml FN- gene medicine using repeated measures analysis, 
ml FN- reduced SCC-VII tumor growth relative to control 

15 plasmid (p=.035). 

Example 4 : ml FN- Gene Medicine Reduces the Growth of MC-38 
Tumors 

Experiments were carried out as described in the 
preceding examples. The effects of various prototype 

20 formulations of mlFN- gene medicine on the growth of 
subcutaneous MC-38 tumors were compared. ml FN- gene 

medicine elicited reduced tumor growth in all formulations 
tested (cationic lipid, peptide, and PINC) . Subsequent 
experiments in the MC-38 tumor model have shown similar 

25 results. 

Example 5: Dose Responses 

After demonstrating anti tumor effects of ml FN- gene 
medicine, the dose response for these effects was 
investigated in the MC-38 tumor model. Both cationic lipid 
30 (D0TMA:Chol) and PINC (PVA) delivery systems were evaluated. 
Results clearly show that ml FN- gene medicine elicited a 
dose dependent reduction in tumor growth. Tumor volume in 
this experiment was maximally reduced by approximately 50 % 
with ml FN- /DOTMA:Chol and 60 % with ml FN- /PVA after 4 
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treatments. Maximal reduction in tumor volume was observed 
at a plasmid dose of approximately 50 ug/treatment 
(cumulative dose of approximately 200 pg) . These 
experiments will be conducted primarily in the MC-38 tumor 
5 model because it provides a broader treatment window than 
does the SCC-VII model. 

Example 6: IFN-alpha Formulations 

The formulations for the IFN-a are: (1) PVP 4 vial, (2) 

PVP three vial, (3) PVP two vial. The details are listed 
10 below: 

PVP 4 vial 

Materials: 25% PVP (50 JcDa) stock solution, plasmid 
stock solution, 5 M NaCl stock solution, and water. 

Method: Add in order of water, plasmid, 25% PVP and 5 M 
15 NaCl into a vial to make a plasmid in 5% PVP in saline 
formulation. The final concentration of PVP and NaCl are 
fixed (5% and 150 mM) and plasmid concentration could be 
varied (but based on the IGF-1 data, 3 mg DNA/ml in 5% PVP 
in saline should be the best formulation) . The quality of 
20 the formulation is characterized by pH, DNA concentration, 
osmolality, and gel electrophoresis. The DNA concentration 
could be varied from 0.1-5 mg/ml. The pH may be varied from 
3-5, osmolality may be 250 - 400 mOsm. 

Three vial 

25 Material: lyophilized PVP, plasmid stock solution (4 

mg/ml), 115 mM Na-Citrate/5% NaCl stock buffer (pH = 4). 

Method: Add in order of plasmid and buffer into the PVP 
to make final 3 mg DNA/ml in 5% PVP in 25 mM citrate/saline 
buffer (pH =4). DNA expression is higher in saline than in 

30 the citrate buffer. 
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Two vial 

Materials: Co-lyophilized plasmid and PVP, saline. Add 
saline into the co-lyophilized DNA and PVP to make final 3 
mg/ml DNA in 5% PVP in saline. 

The final formulation is 3mg/mL DNA, 5% PVP as a single 
vial. The formulation is prepared by adding (5%) PVP to 
(4mg/mL) DNA and recirculating the two components for a 
finite period of time (using static mixer) . Then the 
formulation is lyophilized and rehydrated with 0.9% sodium 
chloride, to obtain a final composition of 3mg/mL, 5%PVP in 
saline. 

Example 7: Treatment of Human Tumors 

The murine studies are predictive of the response of 
Human tumors to therapy using a plasmid construct encoding 
the human I FN alpha gene sequence such as that depicted in 
SEQ ID NO: 10, 11 or 12. A patient in need of anti-cancer 
therapy is injected with up to 3mg of plasmid formulation at 
daily intervals. The plasmid formulation may contain INF 
alpha plasmid alone or optionally a mixture of IFN-alpha 
encoding plasmid and an additional plasmid coding for .a 
cytokine. The preffered cytokine is IL-12. The treatments 
are continued and the patient monitored as is the usual 
practice for anti-cancer chemotherapeutic regimes. 

25 one skilled in the art would readily appreciate that 

the present invention is well adapted to carry out the 
objects and obtain the ends and advantages mentioned, as 
well as those inherent therein. The molecular complexes and 
the methods, procedures, treatments, molecules, specific 
compounds described herein are presently representative of 
preferred embodiments are exemplary and are not intended as 
limitations on the scope of the invention. Changes therein 
and other uses will occur to those skilled in the art which 
are encompassed within the spirit of the invention are 
35 defined by the scope of the claims. 
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It will be readily apparent to one skilled in the art 
that varying substitutions and modifications may be made to 
the invention disclosed herein without departing from the 
scope and spirit of the invention. 
5 All patents and publications mentioned in the speci- 

fication are indicative of the levels of those skilled in 
the art to which the invention pertains. All patents and 
publications are herein incorporated by reference to the 
same extent as if each individual publication was specific- 
10 ally and individually indicated to be incorporated by 
reference . 

The invention illustratively described herein suitably 
may be practiced in the absence of any element or elements, 
limitation or limitations which is not specifically 

15 disclosed herein. Thus, for example, in each instance 
herein any of the terms "comprising", "consisting 
essentially of" and "consisting of" may be replaced with 
either of the other two terms. The terms and expressions 
which have been employed are used as terms of description 

20 and not of limitation, and there is no intention that in the 
use of such terms and expressions of excluding any 
equivalents of the features shown and described or portions 
thereof, but it is recognized that various modifications are 
possible within the scope of the invention claimed. Thus, 

25 it should be understood that although the present invention 
has been specifically disclosed by preferred embodiments and 
optional features, modification and variation of the 
concepts herein disclosed may be resorted to by those 
skilled in the art, and that such modifications and 

30 variations are considered to be within the scope of this 
invention as defined by the appended claims. 

In addition, where features or aspects of the invention 
are described in terms of Markush groups, those skilled in 
the art will recognize that the invention is also thereby 

35 described in terms of any individual member or subgroup of 
members of the Markush group. For example, if X is 
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described as selected from the group consisting of bromine, 
chlorine, and iodine, claims for X being bromine and claims 
for X being bromine and chlorine are fully described. 

Other embodiments are within the following claims. 
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Claims 


1- A plasmid comprising a CMV promoter 
transcriptionally linked with an interferon ^J™' 
sequence, and a growth hormone 3 • -untranslated region. ' 


10 


2 The plasmid of claim 1, wherein said interferon 
alpha is human interferon alpha. nterferon 

3- The plasmid of claim 2, wherein v, 
having optimal codon usage. - equence 

aloh/' w The PlaSmid ° f Claim 3 ' WhSrein said interferon 

Xo, t:: ;r ence has the nucieotide — - «° - 

^e 3^^'^ £ ^ 
15 hormone gene. hUman gr ° wtn 

ALU T, P , laSmid ° f Claim 5 ' Wh6rein an ALU «*«t or 

ALU repeat-like sequence is deleted from said 3- 
untranslated region. 

?n ■ , I' PlaSmid of clai ™ 1. wherein said plasmid 

Inton J Pr0m ° ter ' 3 ™ A b ° X ' 3 — a " d « «»t 

-tron and mtron/exon boundary in appropriate relationship 
for expression of said coding sequence. ' 

further co^- PlaSmld ° f Claim 7 ' Wh6rein Said Pl«-ld 
further comprises a 5' mRNA leader sequence inserted between 
25 said promoter and said coding sequence between 
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9. The plasmid of claim 1, wherein said plasmid 
further comprises an intron/5' UTR from a chicken skeletal 
a-actin gene. 

10. The plasmid of claim 1, wherein said plasmid 
5 comprises a nucleotide sequence which is the same as the 

nucleotide sequence of plasmid pIF0921. 

11. The plasmid of claim 1, further comprising: 

a first transcription unit comprising a first 
transcriptional control sequence transcriptionally linked 

10 with a first 5 ' -untranslated region, a first intron, a first 
coding sequence, and a first 3 ' -untranslated region/poly (A) 
signal, wherein said first intron is between said control 
sequence and said first coding sequence; and 

a second transcription unit comprising a second 

15 transcriptional control sequence transcriptionally linked 
with a second 5 ' -untranslated region, a second intron, a 
second coding sequence, and a second 3 ' -untranslated 
region/poly (A) signal, wherein said second intron is between 
said control sequence and said second coding sequence; 

20 wherein said first and second coding sequences 

comprise a sequence having the sequence of SEQ ID NO: 2, 3, 4 
or 25 coding for human IL-12 p40 subunit, and a sequence 
having the sequence of SEQ ID NO: 6, 7, 8 or 24 coding for 
human IL-12 p35 subunit. 

25 12. The plasmid of claim 11, wherein said first 

transcriptional control sequence or said second 
transcriptional control sequence comprise one or more 
cytomegalovirus promoter sequences. 

13. The plasmid of claim 11, wherein said first and 
30 second transcriptional control sequences are the same. 
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14. 


The plasmid of claim li, wherein said first and 
second transcriptional control sequences are different 

15 The plasmid of claim 14, wherein said sequence 
coding for the p 40 subunit of human IL-12 is 5 - tc 7 h 
5 sequence coding for the P 35 subunit of human IL-12. 

16. The plasmid of claim 1, further comprisina an 

intron having variable splicing a fir.,- „ „ • 

= „ . , opj.xt.ing, a fxrst coding sequence 

and a second coding sequence, quence, 

0 com • WherSin Sald firSt and SSCOnd cod ing sequences 
comprise a sequence having the sequence of SEQ 

or 25 coding for human IL-12 o 40 subnni,- ,, 

having the sequence of SEQ 10 NO 6 S * Z^'™ 

human IL-12 p35 subunit. f ° r 


17. 


The plasmid of claim 16, further comprising: 

transcriptional control sequence 

transcriptionally linked with a first mH ■ sequence 
seconri first coding sequence and a 

second coding sequence ; 

a 5' -untranslated region; 

an intron 5- to said first coding sequence; 
an alternative splice site 3- to said first codina 
sequence and 5- to said second coding sequence; and * 
a 3 '-untranslated region/poly (A) signal. 

18 The plasmid of claim 17, „ herei n said 
transcriptional control sequence comprises a cytomegalovirus 
promoter sequence. ^ us 

19- The plasmid of claim 1, further comprising: - 

transcriptional control sequence 

transcriptionally li nked witn a fl sequence 
TDpe _ rirst coding sequence, an 

IRES sequence, a second coding sequence, and a 3-- 
untranslated region/poly (A) sianal \ 

^ Yl "' signal, wherein said IRES 
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sequence is between said first coding sequence and said 
second coding sequence; and 

an intron between said promoter and said first 
coding sequence; 

5 wherein said first and second coding sequences 

comprise a sequence having the sequence of SEQ ID NO: 2, 3, 4 
or 25 coding for human IL-12 p40 subunit, and a sequence 
having the sequence of SEQ ID NO: 6, 7, 8 or 24 coding for 
human IL-12 p35 subunit. 

10 20. The plasmid of claim 19, wherein said 

transcriptional control sequence comprises, a cytomegalovirus 
promoter sequence . 

21. The plasmid of claim 19, wherein said IRES 
sequence is from an encephalomyocarditis virus. 

15 22. A composition comprising the plasmid of anyone of 

claims 1-21, and a protective, interactive non-condensing 
compound. 

23. The composition of claim 22, wherein said 
protective, interactive non-condensing compound is polyvinyl 

20 pyrrolidone. 

24. The composition of claim 22, wherein said plasmid 
is in a solution having between 0.5% and 50% PVP. 

25. The composition of claim 24, wherein said solution 
includes about 5% PVP. 

25 26. The composition of claim 22, wherein said DNA is 

at least about 80% supercoiled. 

27. The composition of claim 26, wherein said DNA is 
at least about 90% supercoiled. 
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28. The composition of claim 27, wherein said DNA is 
at least about 95% supercoiled. 

29. A composition comprising a protective, interactive 
non-condensing compound and a plasmid comprising an 

5 interferon alpha coding sequence. 

30. A composition comprising the plasmid of any one of 
claims 1-21 and a cationic lipid with a neutral co- 
lipid. 

, ft ThS con, P° si tion of claim 30, wherein said cationic 

10 lipid is DOTMA. 

32. The composition of claim 30, wherein said neutral 
co-lipid is cholesterol. 

33. The composition of claim 30, wherein the DNA in 
said plasmid and said cationic lipid are present in such 

15 amounts that the negative to positive charge ratio is about 

34. The composition of claim 30, wherein said DNA is 
at least about 80% supercoiled. 

35. The composition of claim 34, wherein said DNA is 
20 at least about 90% supercoiled. 

36. The composition of claim 35, wherein said DNA is 
at least about 95% supercoiled. 

37. The composition of claim 30, further comprising an 
isotonic carbohydrate solution. 
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38. The composition of claim 37, wherein said isotonic 
carbohydrate solution consists essentially of about 10% 
lactose. 

39. The composition of claim 30 wherein said cationic 
5 lipid and said neutral co-lipid are prepared as a liposome 

having an extrusion size of about 800 nanometers. 

40. A composition comprising: 

a first component comprising a plasmid comprising 
an interferon alpha coding sequence and a cationic lipid 
10 with a neutral co-lipid, wherein said cationic lipid is 
DOT MA and said neutral co-lipid is cholesterol, wherein the 
DNA in said plasmid and said cationic lipid are present in 
amounts such that the negative to positive charge ratio is 
about 1:3; and 

15 a second component comprising a protective, 

interactive non-condensing compound, wherein said first 
component is present within the second component. 

41. A composition comprising a protective, interactive 
non-condensing compound, a first plasmid comprising an 

20 interferon alpha coding sequence, and one or more other 
plasmids independently comprising an IL-12 or IL-12 subunit 
coding sequence. 

42. A composition comprising a plasmid comprising an 
interferon alpha coding sequence and a cationic lipid with a 

25 neutral co-lipid. 

43. A method for making a plasmid of anyone of claims 
1-21 comprising the step of inserting a CMV promoter 
transcriptionally linked with an interferon alpha coding 
sequence, and a growth hormone 3 ' -untranslated region into a 

30 plasmid. 
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44. A method for making a composition of claim 29, 
comprising the steps of: 

a. preparing a DNA molecule comprising a 
transcriptional unit, wherein said transcriptional unit 
comprises an interferon alpha coding sequence; 

b. preparing a protective, interactive non- 
condensing compound; and 

c. combining said protective, interactive non- 
condensing compound with said DNA in conditions such that a 
composition capable of delivering a therapeutically 
effective amount of an interferon alpha coding sequence to a 
mammal is formed. 


is 


45. The method of claim 44 wherein said DNA molecule 
a plasmid, wherein said plasmid comprises a CMV promoter 

transcriptionally linked with an interferon alpha coding 
sequence, and a human growth hormone 3 * -untranslated 
region/poly (A) signal . 

46. A method of making a composition of claim 30, 
comprising the steps of: 

a. preparing a DNA comprising an interferon 
alpha coding sequence; 

b. preparing a mixture of a cationic lipid and a 
neutral co-lipid, wherein said cationic lipid is DOT MA and 
said neutral co-lipid is cholesterol; and 

c. combining said mixture with said DNA in 
amounts such that said cationic lipid and said DNA are 
present in a negative to positive charge ratio of about 1:3. 

47. A method of making a composition of claim 40, 
comprising the steps of: 

a. preparing a first component comprising a 
plasmid comprising an interferon alpha coding sequence and a 
cationic lipid with a neutral co-lipid, wherein said 
cationic lipid is DOTMA and said neutral co-lipid is 
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cholesterol, wherein the DNA in said plasmid and said 
cationic lipid are present in amounts such that the negative 
to positive charge ratio is about 1:3; 

b. preparing a second component comprising a 
protective, interactive non-condensing compound; and 

c. combining said first and second components 
such that the resulting composition comprises said first 
component within said second component. 

48. A method of making a composition of claim 41, 
comprising the steps of: 

a. preparing a protective, * interactive non- 
condensing compound, 

b. preparing a first plasmid comprising an 
interferon alpha coding sequence, 

c. preparing one or more other plasmids 
independently comprising an IL-12 p35 or IL-12 p40 subunit 
coding sequence, and 

d. combining said protective, interactive non- 
condensing compound, said plasmid comprising said interferon 
alpha coding sequence and said other plasmids. 

49. A method of making a composition of claim 42 
comprising combing a plasmid comprising a interf eronalpha 
coding sequence and a cationic lipid with a neutral co- 
lipid. 

50. A method for treatment of a mammalian condition or 
disease, comprising administering to a mammal suffering from 
said condition or disease a therapeutically effective amount 
of a plasmid of anyone of claims 1-21. 

51. The method of claim 50, wherein said condition or 
disease is a cancer . 
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521 The method of claim 50, wherein said composition 
is administered by injection. 

53. A method for transfection of a cell in situ 

comprising the step of contacting said cell with a plasmid 

5 of anyone of claims 1-21 for sufficient time to transfect 
said cell. 

54. The method of claim 53, wherein transfection of 
said cell is performed in vivo. 


10 


55. The method of claim 53, wherein* said contacting is 
performed in the presence of an about 5% PVP solution. 

56. A method for delivery and expression of an 
interferon alpha gene in a plurality of cells, comprising 
the steps of: 

(a) transfecting said plurality of cells with a 
15 plasmid of anyone of claims 1-21; and 

(b) incubating said plurality of cells under 
conditions allowing expression of a nucleic acid sequence in 
said vector, wherein said nucleic acid sequence encodes 
interferon alpha. 

20 57 * The ^thod of claim 56, wherein said interferon 

alpha is human interferon alpha and said cells are human 
cells . 

58. The method of claim 56, wherein said contacting is 
performed in the presence of an about 5% PVP solution. 

25 59 - A me thod for treating a disease or condition, 

comprising the steps of transfecting a cell in situ with a 

plasmid of any one of claims 1-21. 
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60. The method of claim 59, wherein said disease or 
condition is a localized disease or condition. 

61. The method of claim 59, wherein said disease of 
condition is a systemic disease or condition. 

62. A cell transfected with a plasmid of anyone of 
claims 1-21. 


63. A method for treatment of a mammalian condition or 
disease, comprising administering to a mammal suffering from 
said condition or disease a therapeutically effective amount 

10 of a composition of claim 22. 

64. A method for treatment of a mammalian condition or 
disease, comprising administering to a mammal suffering from 
said condition or disease a therapeutically effective amount 
of a composition of claim 29. 

15 65. A method for treatment of a mammalian condition or 

disease, comprising administering to a mammal suffering from 
said condition or disease a therapeutically effective amount 
of a composition of claim 30. 


66. A method for treatment of a mammalian condition or 
20 disease, comprising administering to a mammal suffering from 

said condition or disease a therapeutically effective amount 
of a composition of claim 40. 

67. A method for treatment of a mammalian condition or 
disease, comprising administering to a mammal suffering from 

25 said condition or disease a therapeutically effective amount 
of a composition of claim 41. 

68. A method for treatment of a mammalian condition or 
disease, comprising administering to a mammal suffering from 
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said condition or disease a therapeutically effective amount 
of a composition of claim 42. 


69. A method for treatment of a mammalian condition or 
disease, comprising administering to a mammal suffering from 
said condition or disease a therapeutically effective amount 
of a composition of a first plasmid comprising an interferon 
alpha coding sequence and a second plasmid comprising a IL- 
12 coding sequence. 
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EIGURR 1 mr]Vf H_[? 
PLASMTD P flM|m 


:cctcgccgtcgggc 


ttccatccgagtacgt 
gc.cgc.cga,gcgatg«cgcngpgg,cga^^ 


- - _ ^ctgcg 
caaggaacgcccpcgtggccagccacga^^ 


gaacctgcgtgcaatcca.cngttcaa.ca.gcgaaacgatcctcatcctgtc.ct.gatcaga.^^^ 
gcca.ecagtnacmgcagggcttcccaaccttaccagagggcgaancgagcttgcatgcetgcagg.cgtta^^^ 
gc.gaccgcccaacgacccccgcccattgacgtcaataa.gacgutgttccca.agtaacgccaaugggacm^^^ 


cctggcattatgcccag 
ictttccaaaatgtcgtaacaactcc 


gctgtmgacctccatagaagacaecgggaccgatccagcctccgcggccgggaacggtgcanggaacgcgganccccgtg^ 
««««««mecti«cc^^ 

gcc.ccatggpccagcgcgcagccecctccttgtggcuccctggtcctcctggaccacctcagtttggccag^ 
aggaatgttcccatgccncaccactcccaaaa^ 

8-g"8*«g«ca.gaagamcacaaaaga,aaaa=^ 
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gagacctcmcataactaatgggagttgcctggcctccagaaagacctcttttatgatggccctgtgccrtagtagtatttatgaagacttgaagatgtaccagg 
tggagttcaagaccatgaatgcaaagcttctgatggatcctaagaggcagatctttctagatcaaaacatgctggcagttattgatgagctgatgcaggccctg 
aamcaacagtgagactgtgccacaaaaatcctcccttgaagaaccggatttttataaaactaaaatcaagctctgcatacttcttcatgcmcaga^ 
cagtgactattgatagagtgatgagctatctgaatgcttcctaacaattcte^ 

tgcctctcctggccctggaagrtgccactccagtgcccaccagccttgtcctaataaaattaagttgcatcattttgtctgactaggtgtccttctataatattatgg 

ggtggaggggggtggtatggagcaaggggcaagttgggaagacaacctgtagggctcgagggggggcccggtacggtcgttacataacttacggtaaa 

tggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaaUgggactttccattgacgtcaatg 

ggtggagtatttacggtaaactgcccacttggcagtacatcaagtgtatcate^^ 

catutgcccagtacatgaccttatgggacmcctacttggcagtacatctacgj^^^ 

gcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgggactttccaaaatgtc 

gtaacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctcgtttagtgaaccgtcagatcgcctggaga 

cgccatccacgctgttttgacctccatagaagacaccgggaccgatccagcctccgcggccgggaacggtgcattggaacgcggartccccgtgttaarta 

acaggtaagtgtcncctcctgmccttcccctgctattctgctcaaccttcctatcagaaactgcagtatctgtattmgctagaattgtactaacggncttt^ 

tcncacaggcttaagtcatgggtcaccagcagttggtcatctcttggttttccctggtttttctggcatctcccctcgtggccatatgggaactgaagaaagatgt 

ttatgtcgtagaattgganggtatccggatgcccctggagaaatggtggtcctcacctgtgacacccctgaagaagatggtatcacctggaccttggaccag 

agcagtgaggtcttaggctctggcaaaaccctgaccatccaagtcaaagagmggagatgctggccagtacacctgtcacaaaggaggcgaggttctaag 

ccattcgctcctgctgctteacaaaaaggaagatggaam^ 

gccaagaattattctggacgtttcacctgctggtggctgacgacaatcagtactgatttgacattcagtgtcaaaagcagcagaggctcttctgacccccaag 
gggtgacgtgcggagctgctacactctctgcagagagagtcagaggggacaacaaggagtatgagtactcagtggag^gccaggaggacagtgcctgc 
ccagctgctgaggagagtctgcccattgaggtcatgg^ggatgccgttcacaagctcaagtatgaaaactacaccagcagcttcttcatcagggacatcatc 
aaacctgacccacccaagaacttgcagctgaagccattaaagaattctcggcaggtggaggtcagctgggagtaccctgacacctggagtactccacattc 
ctacttctccctgacattctgcgttcaggtccagggcaagagcaagagagaaaagaaagatagagtcttcacggacaagacctcagccacggtcatctgcc 
gcaaaaatgccagcattagcgtgcgggcccaggaccgctactatagctcatcttggagcgaatgggcatctgtgccctgcagttagacgcgctagaaaag 
ccgaattctgcaggaattgggtggcatccctgtgacccctccccagtgcctctcctggccctggaagttgccactccagtgcccaccagccttgtcctaataa 

aattaagttgcatcatmgtctgactaggtgjccttctataa^ 
ggctcgagggggggcccggtaccagctmgttccctttagtgagggfm^ 
gctcacaattccacacaacatacgagccggaagcataaagtgta^ 
cgcmccagtcggga^cctgtcgtgccagctgcattaatg^ 

cactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcagga 

aagaacatgtgagcaaaaggccagcaaaaggcw^ 
aaaaatcgacgctcaagtcagaggtggcgaaacccgacagga^^ 
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tcgctc 
;acacgacttatcgcc 

^ggtgctacagagttcttgaagtggtggcctaactacggctacactagaagg 

_ tggtagcggtggttttttt 
^B^8'«gcgcagaaaaaaagg^^ 

tagaaggcga.gcgc,gcgaatcgggagcggcgau,ccgtaaagcacgaggaagcggtcagccca„cgc=gccaagc,ctt=agca M 

gccaacgctatgtcctgatagcggtccgccacacccag 
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F IGUR E ? 


Codon Frequency 

human_htgh.doc 

AaAQV OOion HdftM* AOTtt Fr*e©fc*» 


Gly 

GGT 

441.00 

9.14 

0.12 

Gly 

GCC 

1867.00 

38.70 

0.50 

du 

GAG 

2430.00 

50.16 

0.75 

Ou 

GAA 

792.00 

16.42 

0.25 

Aip 

GAT 

592.00 

12.27 

0.23 

Atp 

GAC 

1121.00 

37.75 

0.75 

Vd 

GTG 

1166.00 

38.68 

0.64 

V»l 

GTA 

04.00 

2.7g 

0.05 

Vtl 

GTT 

191.00 

4.10 

0.01 

V»l 

GTC 

728.00 

15.09 

0.2S 

All 

GCG 

652.00 

13.51 

0.17 

AU 

CCA 

411.00 

10.12 

0.13 

AU 

GCT 

654.00 

13.56 

0.17 

Ala 

GCC 

2057.00 

42.64 

0.53 

Arg 

AGG 

112.00 

10.61 

0.18 

A/g 

AGA 

298.00 

6.18 

0.10 

Ser 

AGT 

354.00 

7J4 

0.10 

Sct 

AGC 

1171.00 

24 J7 

0.34 

Ly» 

AAC 

2117.00 

43.88 


Ly» 

AAA 

471.00 

9.76 

0.18 

Asa 

AAT 

114.00 

6.51 

0.22 

Am 

AAC 

1 120.00 

23 27 

0,78 

Met 

ATG 

1077.00 

22J2 

1.00 

De 

ATA 

81.00 

1.82 

0.05 

De 

ATT 

315.00 

6.53- 

0.18 

Q« 

ATC 

1369.00 

2SJ8 

0.17 

Ttar 

ACQ 

405 00 

8.40 

0.15 

Thr 

ACA 

373.00 

7.73 

0.14 

Thr 

ACT 

351.00 

7.42 

0.14 

Thr 

ACC 

1502.00 

31.13 

0.37 

Ttp 

TGG 

632.00 

13.51 

1.00 

End 

TGA 

109.00 

2.26 

0.55 

Cy» 

TGT 

125.00 

6.74 

0J2 

Cym 

TGC 

706.00 

14.63 

0.68 

End 

TAG 

42.00 

0.87 

0J1 

End 

TAA 

46.00 

0.95 

011 

Tyr 

TAT 

360.00 

7.46 

0.26 

Tyr 

TAC 

1042.00 

21.60 

0.74 
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PIF0836 



ittttggattccccttggagaaggtgg 

;aacctcttcacatcaaaagacttgtctgctacttggaate 

c ^*ctagac t cattctgc^^ „ f 

gactccctgct 

atcagagcagaagtctggagagcc 

4 - atccctgtgacccctcccca 

iacaacctgtagggctcgagggggggcccggtaccagcttttgttccctttagtga 

gaagcataaagtgt 
ctgcattaatgaat 
gctgcggcgagcggtat 
acatgtgagcaaaaggccagcaaaaggccaggaaccFt 

actggtaacaggattagcagagcgaggtat 
atttggtatctgcgctctgctgaagccagttaccttcggaa 
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aaagagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtiumglllgcaagcagcagattacgcgcagaaaaaaaggatctcaag 

aagatcctttgatcttttctacggggtctgacgctcagaagaactcgtcaagaaggcgatagaaggcgatgcgctgcgaatcgggagcggcgataccgtaa 

agcacgaggaagcggtcagcccattcgccgccaagctcttcagcaatatcacgggtagccaacgctatgtcctgatagcggtccgccacacccagccgg 

ccacagtcgatgaatccagaaaagcggccattttccaccatgatattcggcaagcaggcatcgccatgcgtcacgacgagatcctcgccgtcgggcatgc 

gcgccttgagcctggcgaacagttcggctggcgcgagcccctgatgctcttcgtccagatcatcctgatcgacaagaccggcttccatccgagtacgtgctc 

gctcgatgcgatgtttcgcttggtggtcgaatgggcaggtagccggatcaagcgtatgcagccgccgcattgcatcagccatgatggatactttctcggcag 

gagcaaggtgagatgacaggagatcctgccccggcacttcgcccaatagcagccagtcccttcccgcttcagtgacaacgtcgagcacagctgcgcaag 

gaacgcccgtcgtggccagccacgatagccgcgctgcctcgtcctgcagttcattcagggcaccggacaggtcggtcttgacaaaaagaaccgggcgcc 

cctgcgctgacagccggaacacggcggcatcagagcagccgattgtctgttgtgcccagtcatagccgaatagcctctccacccaagcggccggagaac 

ctgcgtgcaatccatcttgttcaatcatgcgaaacgatcctcatcctgtctcttgatcagatcttgatcccctgcgccatcagatccttggcggcaagaaagcca 

tccagtttacmgcagggcttcccaaccttaccagagggcgaattcgagcttgcatgcctgcaggtcgttacataacttacggtaaatggcccgcctggctga 

ccgcccaacgacccccgcccattgacgtcaataatgacgtatgtt<xcatagtaacgc^ 

aaactgcccacttggcagtacatcaagtglatcatatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatgcccagtacat 
gaccttatgggacmcctacnggcagtacatctacgtattagtcatcgctattaccatggtgatgcggtmggcag^acatcaatgggcgtggatagcgg^ 
gactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgggactttccaaaatgtcgtaacaactccgcccc 
augacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctcgtttagtgaaccgtcagatcgcctggagacgccatccacgctgttt 
tgacctccatagaagacaccgggaccgatccagcctccggactctagctagag 
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FIGURE 5 MOUSE IL-12 

BOVINE GH 3' UTR 



agcttcgagggggggcccggtaccagctmgttcccmag^ 
g^aattccacacaacatacgagccggaa 

cgcmccagtcgggaaacctgtcgtgccagctgcattaatgaatcggccaacgcgcggggagaggcggmgcgtattgggcgctc^ 

cactgactcgctgcgctcggtcgttcggctgcggcgagcgg^tcagctcactcaaaggcggtaatacggttatccacagaatcagggga^^ 

aagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgttmccata 

aaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgmccccctggaagctccctc^ 

cctgccgcttaccggaucctgtccgccmctcccttcgggaagcg^ggcgcmctcatagctcacgctgtag^^ 

caagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgcctutccggtaactatc^^^ 


acactagaagg 

acagtamggtatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcngatccggca^^ 
gmgcaagcagcagattacgcgcagaaaaa^^^ 


gccaacgcmgtcctgatagcggtccgccacacccagccggccacagtcgatgaatccagaaaagcggccatmccaccatgatattcggc^ 

catcgccatgcgtcacgacgagatcctcgccgtcgggcatgcgcgccttgagcctggcgaacagttcggctggcgcgagcc^ 

atcatcctgatcgacaagaccggcnccatccgag^cgtgctcgctcgatgcgatgmcgcrtggtggtcgaatgg 

cagccgccgcattgcatcagccatgatggatactttctcggcaggagcaaggtgagatgacaggagatcctgccccggcacttcgcccaatagcagccag 
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tcccttcccgcttcagtgacaacgtcgagcacagctgcgcaaggaacgcccgtcgtggccagccacgatagccgcgctgcctcgtcctgcagttcattcag 

ggcaccggacaggtcggtcttgacaaaaagaaccgggcgcccctgcgctgacagccggaacacggcggcatcagagcagccgattgtctgttgtgccc 

agtcatagccgaatagcctctccacccaagcggccggagaacctgcgtgcaatccatcttgttcaatcatgcgaaacgatcctcatcctgtctcttgatcagat 

cttgatcccctgcgccatcagatccttggcggcaagaaagccatccagtttactttgcagggcttcccaaccttaccagagggcgccccagctggcaattcc 

ggttcgcttgctgtccataaaaccgcccagtctagcaactgttgggaagggcggggctgcaggaattcgagcttgcatgcctgcaggtcgttacataactta 

cgg^aaatggwcgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactttccattgac 

gtcaatgggtggagtatttacggtaaactgcccacttggcag^catcaagtgtatcatatgccaag^acgccccctattgacgtcaatgacggtaaatggcc 

cgcctggcattatgcccagtacatgaccttatgggacmcctacttggcag^catctocgtattogtcatcgctattaccatggtgatgcggttttggcagtaca 

tcaatgggcg^ggaugcggWgactcacggggamccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatcaacgggactttcca 

aaatgtcgmcaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgggaggtcUtataagcagagctcgtttagtgaaccgtcagatcgc 

ctggagacgccatccacgctgttttgacctccatagaagacaccgggaccgatccagcctccgcggccgggaacggtgcattggaacgcggattccccgt 

gtiaattaacaggtaagtgtcttcctcctgtttccttcccctgctattctgctcaaccttcctatcagaaactgcagtatptgtattmgctagcagtaatact 

ttcttttmctcttcacaggccacgatgtg^caatcacgctacctcctcttmggccacccttgccctcctaaaccacctcagtttggccagggtc 

tggacctgccaggtgtcttagccagtcccgaaacctgctgaagaccacagatgacatggtgaagacggccagagaaaaactgaaacattattcctgcactg 

ctgaagacatcgatcatgaagacatcacacgggaccaaaccagcacattgaagacctgtttaccactggaactacacaagaacgagagttgcctggctact 

agagagacttcttccacaacaagagggagctgcctgcccccacagaagacgtctttgatgatgaccctgtgccttggtagcatctatgaggacttgaagatg 

taccagacagagttccaggccatcaacgcagcacttcagaatcacaaccatcagcagatcattctagacaagggcatgctggtggccatcgatgagctgat 

gcagtctctgaatcataatggcgagactctgcgccagaaacctcctgtgggagaagcagacccttacagagtgaaaatgaagctctgcatcctgcttcacg 

ccttcagcacccgcgtcgtgaccatcaacagggtgatgggctatctgagctccgccttaactagtgaaagctcaaggccctctgccacagcgccctcctcac 

acagataggaattctgcagatcctagagctcgctgatcagcctcgactgtgccttctagttgccagccatctgttgtttgcccctcccccgtgccttccttgacc 

ctggaaggtgccactcccactgtcctttcctaataaaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctggggggtggggtggggcagga 

cagcaagggggaggattgggaagacaatagcaggcatgctggggatgcggtgggctctatggcttctgaggcggaaagaaccagctggggctcgaaat 

tcgagcttgcatgcctgcaggtcg^cataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatg 

ttcccatagtaacgccaaugggacmccattgacgtcaatgggtggag^ttucggtaaactgcccacttggcagtacatcaagtgtatc 

cgccccctattgacgtc^tgacggtaaatggcccgcctggcattatgcccag^catgaccttatgggacmcctacttggcagtacatctacgtattagtca 

tcgctottaccatggtgatgcggtmggcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatggg 

agtttg^ggcaccaaaatcaacgggacmccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgg 

taagcagagctcgtttagtgaaccgtcagatcgcctggagacgccatccacgctgttttgacctccatagaagacaccgggaccgatccagcctccgcggc 

cgggaacggtgcattggaacgcggattccccgtgttaatmcaggtaagtgtcttcctcctgtttccttcccctgctattctgctcaaccttcctatcagaaact 

gcagtatctgUttmgctagcagtaaUctaacgg^cttttmctcttcacaggccacgatg^ 

ggtgtctccactcatggccatg^gggagctggagaaagacgtttatgttgtagaggtggactggactcccgatgcccctggagaaacagtgaacctcacct 
gtgacacgcctgaagaagatgacatcacctggacctcagaccagagacatggagtcataggctctggaaagaccctgaccatcactgtcaaagagtttcta 
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aaaaamcaaaaacaagacmcctgaagtgtgaagcaccaaattactccggacggttcacgtgctcatggct^ 
acatcaagagcag^gcagttccc^ 


agaactacagcaccagcttcttcatcagggacatcatcaaaccagacccgcccaagaacttgcagatgaagccmg^^ 


attcctcatgcagcaagtgggcatgtgttccctgcagggtccgatcctagagctcgctgatcagcctcgactgtgcc 

ccctcccccgtgccnccngaccctggaaggtgccactcccactgtccmcctaataaaatgaggaaattgcatcgcattgtc^ 

ggggggtggggtggggcaggacagcaagggggaggattgggaagacaatagcaggcatgctggggatgcggtgggctctatggcttctga^^^ 
aagaaccagctggggctcgagcatgca 
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FIGURE 6 HUMAN I FN 
pIF0921 


cgttacataacttacggtaaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagg 
gacmccattgacgtcaatgggtggagfamacggtaaactgcccacttggcagt^ 

cggtaaatggcccgcctggcattatgcccagtacatgaccttatgggactttcctacttggcagtacatctacgtattag^catcgctattaccatggtgatgcg 

gttttggcagtacatcaatgggcgtggatagcggtttgactcacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaaatca 

acgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggtaggcgtgtacggtgggaggtctatataagcagagctcgtttagtgaa 

ccgtcagatcgcctggagacgccatccacgctgttttgacctccatagaagacaccgggaccgatccagcctccgcggccgggaacggtgcattggaac 

gcggattccccgtgttaattaacaggtaagtgtcttcctcctgtttccttcccctgctattctgctcaaccttcctatcagaaactgcagtatctgtatttttgctagca 

gmtactaacggttcttttmctcttcacaggccaccatggccttgaccm 

ctgtgatctgcctcaaacccacagcctgggtagcaggaggaccttgatgctcctggcacagatgaggagaatctctcttttctcctgcttgaaggacagacat 

gactttggatttccccaggaggagtttggcaaccagttccaaaaggctgaaaccatccctgtcctccatgagatgatccagcagatcttcaatctcttcagcac 

aaaggactcatctgctgcttgggatgagaccctcctagacaaattctacactgaactctaccagcagctgaatgacctggaagcctgtgtgatacagggggt 

gggggtgacagagactcccctgatgaaggaggactccattctggctgtgaggaaatacttccaaagaatcactctctatctgaaagagaagaaatacagcc 

cngtgcctgggaggttgtcagagcagaaatcatgagatctttttctttgtcaacaaacttgcaagaaagtttaagaagtaaggaatgaatctagaaaagccga 

attctgcaggaattgggtggcatccctgtgacccctccccagtgcctctcctggccctggaagttgccactccagtgcccaccagccttgtcctaataaaatta 

agngcatcattttgtctgactaggtgtccttctataatattatggggtggaggggggtggtatggagcaaggggcaagttgggaagacaacctgtagggctc 

gagggggggcccggtaccagctragttccctttagtgagggttaatttcgagcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgttatccgctca 

caanccacacaacatacgagccggaagcataaagtgtaaagcctggggtgcctaatgagtgagctaactcacattaattgcgttgcgctcactgcccgcttt 

ccagtcgggaaacctgtcgtgccagctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctcactg 

actcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcaggaaagaa 

catgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcg^gctggcgttmccataggctccgcccccctgacgagcatcacaaaaat 

cgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccctgcc 

gctUccggatacctgtccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagct 

gggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggc 

agcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacactagaaggacagtat 

ttgg^tctgcgctctgctgaagccagtteccttcggaaaaagagttgg^ 

agcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagaagaactcgtcaagaaggcgatagaag 
gcgatgcgctgcgaatcgggagcggcgataccgtaaagcacgaggaagcggtcagcccattcgccgccaagctcttcagcaatatcacgggtagccaa 
cgctatgtcctgatagcggtccgccacacccagccggccacagtcgatgaatccagaaaagcggccattttccaccatgatattcggcaagcaggcatcgc 
catgcgtcacgacgagatcctcgccgtcgggcatgcgcgccttgagcctggcgaacagttcggctggcgcgagcccctgatgctcncgtccagatcatcc 
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tgatcgscaagaccggeuccatccgagtec^^^ 
ccgcangcatcagcea.ga.ggaucmctcgg^ 

ccgcttc a gtgacaaegtcgagcacagct6cgcaaggaacgccegtcgtggccagccacgatagccgcgctgcc,cgtcctgca^ 
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Figure 7 A IFNnlas 
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Figure 7B IVN^ 


DNA coding sequence for IFN-oc2b gene in pIF0921 

ATGGCCTTGACCTTTGCTTTACTGGTGGCCCTCCTGGTGCTCAGCTGCAAGTCAAGCTGCTCTGTGGGCTGTG 

ATCT^CCTCAAACCCACAGCCTGGGTAGCAGGAGGACCTTGATGCTCCTGGCACAGATGAGGAGAATCTCTCT 

TTTCTCCTGCTTGAAGGACAGACATGACTTTGGATTTCCCCAGGAGGAGTTTGGCAACCAGTTCCAAAAGGCT 

GAAACCATCCCTGTCCTCCATGAGATGATCCAGCAGATCTTCAATCTCTTCAGCACAAAGGACTCATCTGCTG 

CTTGGGATGAGACCCTCCTAGACAAATTCTACACTGAACTCTACCAGCAGCTGAATGACerGGAAGCCTGTGT 

GATACAGGGGGTGGGGGTGACAGAGACTCCCCTGATGAAGGAGGACTCCATTCTGGCTGTGAGGAAATACTTC 

CAAAGAATCACTCTCTATCTGAAAGAGAAGAAATACAGCCCTTGTGCCTGGGAGGTTGTCAGAGCAGAAATCA 
TGAGATCTTTTTCTTTGTCAACAAACTTGCAAGAAAGTTTAAGAAGTAAGGAATGA 
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Figure 9 
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Sequence Listing Part 

<110> NORDSTROM, JEFF; PERICLE, FEDERICA; ROLLAND, 

ALLAIN; RALSTON, ROBERT 

<120> INTERFERON ALPHA PLASMIDS AND DELIVERY SYSTEMS , 

5 AND METHODS OF MAKING AND USING THE SAME 

<150> US 08/949,160 and PCT/US97 /1877 9 

<151> October 10, 1997 

<160> 25 

<210> 1 

10 <211> 328 

<212> amino acid 

<400> 1 

Met Cys His Gin Gin Leu Val lie Ser Trp Phe Ser Leu Val Phe Leu 

15 1 5 10 15 

Ala Ser Pro Leu Val Ala lie Trp Glu Leu Lys Lys Asp Val Tyr Val 
20 25 30 

20 Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
35 40 45 

Thr Cys Asp Thr Pro Glu Glu Asp Gly lie Thr Trp Thr Leu Asp Gin 
50 55 60 

25 

Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr lie Gin Val Lys 
65 70 75 80 

Glu Phe Gly Asp Ala Gly Gin Tyr Thr Cys His Lys Gly Gly Glu Val 
30 85 90 95 

Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly He Trp 
100 105 HO 

35 Ser Thr Asp He Leu Lys Asp Gin Lys Glu Pro Lys Asn Lys Thr Phe 
115 120 125 

Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
130 135 140 

40 

Leu Thr Thr He Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145 150 155 160 

Gly Ser Ser Asp Pro Gin Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
45 165 170 175 

Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
180 185 190 
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Cys Gin Glu Asp Ser Ala Cys Pro Ala Ala n„ r-i o 

195 y onn lu Glu Ser Leu Pro He 

^ 00 205 

«u Val Met val Asp Ala val His Lys Leu Lys Tyr clu Asn Tyr Thr 

^j.^ 220 
Ser Ser Phe Phe Ile Arg Asp Ile rle Lys ^ Asp ^ ^ ^ ^ 

Leu 01 „ Leu Lys p Leu Lys Asn v ^ s ^ 2 ™ 

255 

ClU T Vr Pro A 3p Thr Trp ser t ^ t ^ 

265 270 
P h e Cys Val 61B Val Gln Gly ^ ser Lys ^ g ^ ^ g 

Val Pfce Thr Asp Lys Thr s„ Ala Thr Val Ile Cys ^ Lys ^ 

300 

Ser lle ser Val Arg Ala Gin Asp Arg Tyr Tyr Ser Ser Ser Tcp ^ 


315 

Glu Trp Ala Ser Val Pro Cys Ser 
325 

<210> 2 
<211> 987 
<212> nucleic acid 

<400> 2 


320 


SSSSE SSSSS SESSS SSS TTTTTCTGGC ATCTCCCCTC 60 

GCCCCTGGAG AAATGGTGGT Cct2££S£ SJS^SSS ^ TTGGAT TG GTATCCGGAT 120 

ACCTTGGACC AGAGCAGTGA GGTCTTAGGC f^Si™* 1 ™ T ATCACCTGG ieo 

GAGTTTGGAG ATGCTGGCCA GTACACCTg5 CCCTGACCAT CCAAGTCAAA 24 0 

CTCCTGCTGC TTCACAAAAA GGAM^SS SSSSSF™ GCGA GGTTCT AAGCCATTCG 300 

AAAGAACCCA AAAATAAGAC CTTTCTAAGA 5£££££ CTGATATTTT AAAGGACCAG 360 

ACCTGCTGGT GGCTGACGAC AATCAGTArT SSSS?** AGAATTATTC TGGACGTTTC 420 

GGCTCTTCTG ACCCCCAAGG otSSJS ^ISSST TCAGTGTCAA AAGCAGCAGA 480 

AGAGGGGACA ACAAGGAGTA ^A^A^CA CACTCTCTGC AGAGAGAGTC 540 

GCTGCTGAGG AGAGTCTGCC 2SS£Stc SgG^Sat? ?™S GACAG TGCG TGCCCA 600 

GAAAACTACA CCAGCAGCTT CTTCATCAGT ?Ir55S™ CCGTTCACAA GCTCAAGTAT 660 

TTGCAGCTGA AGCCATTAAA SaISctcS CAG^I^ JJ££ TGACCC ACG CAAGAAC 720 

ACCTGGAGTA CTCCACATTC CTACTTCTCC SSSSSS TCAGCTGGGA GTACCCTGAC 780 

AGCAAGAGAG AAAAGAAAGA SgAgTot? ^CCtrllrl GCGTTCAGG *T CCAGGGCAAG 840 

CGCAAAAATG CCAGCATTAG CGTGCG^C ?CTC*GCC*C GGTCATCTGC 900 

GAATGGGCAT CTGTGCCCTG CAGTTAG CAGGACCGCT ACTATAGCTC ATCTTGGAGC 960 
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3 


<210> 3 

<211> 987 

<212> nucleic acid 

<400> 3 

5 

ATGTGCCACC AGCAGCTGGT GATCAGCTGG TTCAGCCTGG TGTTCCTGGC CAGCCCCCTG 60 

GTGGCCATCT GGGAGCTGAA GAAGGACGTG TACGTGGTGG AGCTGGACTG GTACCCCGAC 120 

GCCCCCGGCG AGATGGTGGT GCTGACCTGC GACACCCCCG AGGAGGACGG CATCACCTGG 180 

ACCCTGGACC AGAGCAGCGA GGTGCTGGGC AGCGGCAAGA CCCTGACCAT CCAGGTGAAG 24 0 

10 GAGTTCGGCG ACGCCGGCCA GTACACCTGC CACAAGGGCG GCGAGGTGCT GAGCCACAGC 300 

CTGCTGCTGC TGCACAAGAA GGAGGACGGC ATCTGGAGCA CCGACATCCT GAAGGACCAG 360 

AAGGAGCCCA AGAACAAGAC CTTCCTGCGC TGCGAGGCCA AGAACTACAG CGGCCGCTTC 4 20 

ACCTGCTGGT GGCTGACCAC CATCAGCACC GACCTGACCT TCAGCGTGAA GAGCAGCAGG 4 80 

GGCAGCAGCG ACCCCCAGGG CGTGACCTGC GGCGCCGCCA CCCTGAGCGC CGAGCGCGTG 54 0 

15 CGCGGCGACA ACAAGGAGTA CGAGTACAGC GTGGAGTGCC AGGAGGACAG CGCCTGCCCC 600 

GCCGCCGAGG AGAGCCTGCC CATCGAGGTG ATGGTGGACG CCGTCCACAA GCTGAAGTAC 660 

GAGAACTACA CCAGCAGCTT CTTCATCCGC GACATCATCA AGCCCGACCC CCCCAAGAAC 720 

CTGCAGCTGA AGCCCCTGAA GAACAGCCGC CAGGTGGAGG TGAGCTGGGA GTACCCCGAC 780 

ACCTGGAGCA CCCCCCACAG CTACTTCAGC CTGACCTTCT GCGTGCAGGT GCAGGGCAAG 84 0 

20 AGCAAGCGCG AGAAGAAGGA CCGCGTGTTC AC CG ACAAGA CCAGCGCCAC CGTGATCTG C 900 

CGCAAGAACG CCAGCATCAG CGTGCGCGCC CAGGACCGCT ACTACAGCAG CAGCTGGAGC 960 

GAGTGGGCCA GCGTGCCCTG CAGCTAG 907 

<210> 4 

25 <211> 987 

<212> nucleic acid 

<400> 4 

ATGTGCCACC AGCAGCTGGT GATCAGCTGG TTCTCCCTGG TGTTTCTGGC CAGCCCCCTC 60 

30 GTGGCCATCT GGGAGCTGAA GAAAGACGTG TACGTGGTCG AGCTGGACTG GTACCCCGAC 120 

GCCCCCGGCG AGATGGTGGT CCTGACCTGC GACACCCCCG AGGAAGACGG CATCACCTGG 180 

ACCCTGGACC AGAGCAGTGA GGTGCTGGGC TCCGGCAAGA CCCTGACCAT CCAGGTGAAG 24 0 

GAGTTCGGCG ACGCCGGCCA GTACACCTGC CACAAGGGAG . GCGAGGTGCT GAGCCACTCC 300 

CTCCTGCTGC TCCACAAAAA GGAGGACGGC ATCTGGAGCA CCGACATCCT GAAGGACCAG 360 

35 AAGGAGCCCA AGAACAAGAC CTTCCTGCGC TGCGAGGCCA AGAACTACAG CGGCCGCTTC 4 20 

ACCTGCTGGT GGCTGACCAC GATCAGCACC GACCTGACCT TCAGTGTGAA GAGCAGCAGG 4 80 

GGCTCCAGCG ACCCCCAGGG CGTGACCTGC GGCGCTGCCA CCCTGAGCGC CGAGCGCGTG 54 0 

CGCGGCGACA ACAAGGAGTA CGAGTACAGC GTGGAGTGCC AGGAAGACTC CGCCTGCCCC 600 

GCCGCTGAGG AGAGCCTGCC CATCGAGGTG ATGGTGGACG CCGTTCACAA GCTGAAGTAC 660 

4 0 GAGAACTACA CCAGCAGCTT CTTCATCCGC GACATCATCA AGCCTGACCC ACCCAAGAAC 720 

CTCCAGCTGA AGCCCCTCAA GAACTCCCGC CAGGTGGAGG TGAGCTGGGA GTACCCCGAC 780 

ACCTGGAGCA CGCCCCACTC CTACTTCTCC CTGACCTTCT GCGTGCAGGT CCAGGGCAAG 84 0 

AGCAAGCGCG AGAAGAAAGA CCGGGTGTTC ACCGACAAGA CCAGCGCCAC CGTCATCTGC 900 

CGCAAGAACG CCAGCATCAG CGTGCGCGCC CAGGACCGCT ACTATAGCTC CTCTTGGAGC 960 

4 5 GAGTGGGCCA GCGTGCCCTG CTCCTAG 987 

<210> 5 

<211> 219 
<212> amino acid 

50 <400> 5 

Met Cys Pro Ala Arg Ser Leu Leu Leu Val Ala Thr Leu Val Leu Leu 

15 10 15 
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Asp His Leu Ser Leu Ala Arg Asn Leu Pro Val Ala Thr Pro Asp Pro 
20 25 30 

s Gly Met Phe Pro Cys Leu His His Ser Gin Asn Leu Leu Arg Ala Val 


40 


45 


Ser Asn Met Leu Gin Lys Ala Arg Gin Thr Leu Glu Phe Tyr Pro Cys 
ou 55 60 

10 Thr Ser Glu Glu He Asp His Glu Asp He Thr Lys Asp Lys Thr Ser 

70 75 80 

Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn Glu Ser Cys 
15 85 90 95 

Leu Asn Ser Arg Glu Thr Ser Phe lie Thr Asn Gly Ser Cys Leu Ala 
U 105 110 

Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu Ser Ser lie Tyr 
115 120 12 5 

Glu Asp Leu Lys Met Tyr Gin Val Glu Phe Lys Thr Met Asn Ala Lys 

135 140 

25 Leu Leu Met Asp Pro Lys Arg Gin lie Phe Leu Asp Gin Asn Met Leu 

150 155 16Q 

Ala Val He Asp Glu Leu Met Gin Ala Leu Asn Phe Asn Ser Glu Thr 
30 165 170 175 

Val Pro Gin Lys Ser Ser Leu Glu Glu Pro Asp Phe Tyr Lys Thr Lys 


He Lys Leu Cys lie Leu Leu His Ala Phe Arg He Arg Ala Val Thr 
195 200 205 

He Asp Arg Val Thr Ser Tyr Leu Asn Ala Ser 
210 215 

40 <210> 6 

<211> 660 

<212> nucleic acid 

<400> 6 

45 ?^cSSa nnS^ CCT CCTCCTT <^G GCTACCCTGG TCCTCCTGGA CCACCTCACT 60 
£™ CCCCGT GGCCACTCCA GACCCAGGAA TGTTCCCATG CCTTCACCAC 120 
?T?SS ™™ CGTCAGCAAC ATGCTCCAGA AGGCCAGACA AActSaGA^ 180 
GCACTTCTGA AGAGATTGAT CATGAAGATA TCACAAAAGA TAAAACCAPT iSn 
50 r^tl^S. G CCTGT TTACC ATTGGAATTA ACCAAGAATG AGAGTTGCCT Ia^cSga Inn 
SSS25£ TCATAACTAA TGGGAGTTGC CTGGCCTCCA GAAAGACCTC TOmSS 360 

atpaIS?** IJ£SI agtat ™tgaagac ttgaagatgt accaggtgga gttc^gacc III 

rr^lr^ AGCTTCTGAT GGATCCTAAG AGGCAGATCT TTCTAGATCA AAACATGCTG \ 11 

?C^™g iSSEKS ^° CC7G ^TCAACA GTGAgS J« 

TG AAGAACCGGA TTTTTATAAA ACTAAAATCA AGCTCTGCAT ACTTCTTrAT Inn 

55 GCTTTCAGAA TTCGGGCAGT GACTATTGAC AGAGTGACGA GCTATCTGAA tSSSS 660 


BNSOOCID <WO._ 9947676A2 I 
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<210> 7 

<211> 660 

<212> nucleic acid 

<400> 7 

5 

ATGTGCCCCG CCCGCAGCCT GCTGCTGGTG GCCACCCTGG TGCTGCTGGA CCACCTGAGC €0 

CTGGCCCGCA ACCTGCCCGT GGCCACCCCC GACCCCGGCA TGTTCCCCTG CCTGCACCAC 120 

AGCCAGAACC TGCTGGCGGC CGTGAGCAAC ATGCTGCAGA AGGCCGCGCA GACCCTGGAG 180 

TTCTACCCCT GCACCAGCGA GGAGATCGAC CACGAGGACA TCACCAAGGA CAAGACCAGC 24 0 

10 ACCGTGGAGG CCTGCCTGCC CCTGGAGCTG ACCAAGAACG AGAGCTGCCT GAACAGCCGC 300 

GAGACCAGCT TCATCACCAA CGGCAGCTGC CTGGCCAGCC GCAAGACCAG CTTCATGATG 360 

GCCCTGTGCC TGAGCAGCAT CTACGAGGAC CTGAAGATGT ACCAGGTGGA GTTCAAGACC 420 

ATGAACGCCA AGCTGCTGAT GGACCCCAAG CTCCAGATCT TCCTGGACCA GAACATGCTG 4 80 

GCCGTGATCG ACGAGCTGAT GCAGGCCCTG AACTTCAACA GCGAGACCGT GCCCCAGAAG 540 

15 AGCAGCCTGG AGGAGCCCGA CTTCTACAAG ACCAAGATCA AGCTGTGCAT CCTGCTGCAC 600 

GCCTTCCGCA TCCGCGCCGT GACCATCGAC CGCGTGACCA GCTACCTGAA CGCCACCTGA 660 

<210> 8 
<211> 660 
20 <212> nucleic acid 

<400> 8 

ATGTGCCCCG CCCGCAGCCT GCTGCTGGTG GCCACCCTGG TGCTCCTGGA CCACCTCAGC 60 

CTGGCCCGCA ACCTCCCCGT GGCCACCCCA GACCCCGGCA TGTTCCCATG CCTGCACCAC 120 

2 5 AGCCAGAACC TGCTGGCGGC CGTGAGCAAC ATGCTGCAGA AGGCCGCGCA GACCCTGGAG 180 

TTCTACCCCT GCACCAGCGA GGAGATCGAC CACGAGGACA TCACCAAGGA CAAGACCAGC 24 0 

ACCGTGGAGG CCTGCCTGCC CCTCGAGTTA ACCAAGAACG AGAGCTGCCT CAACAGCCGC 300 

GAGACCTCCT TCATCACCAA CGGCACTTGC CTGGCCTCCC GCAAGACCAG CTTCATGATG 360 

GCCCTGTGCC TGAGCTCCAT CTACGAGGAC CTGAAGATGT ACCAGGTGGA GTTCAAGACC 420 

30 ATGAACGCCA AGCTCCTGAT GGACCCCAAG CTCCAGATCT TCCTGGACCA GAACATGCTG 480 

GCCGTGATCG ACGAGCTGAT GCAGGCCCTG AACTTCAACA GCGAGACCGT GCCCCAGAAG 54 0 

AGCAGCCTGG AGGAGCCCGA CTTCTACAAG ACCAAGATCA AGCTGTGCAT CCTGCTGCAC 600 

GCCTTCCGCA TCCGGGCCGT GACCATCGAC CGCGTGACCA GCTACCTGAA CGCCACGTGA 660 

35 <210> 9 

<211> 188 

<212> amino acid 

<400> 9 


4 0 Met Ala Leu Thr Phe Ala Leu Leu Val Ala Leu Leu Val Leu Ser Cys 


10 


15 


45 


Lys Ser Ser Cys Ser Val Gly Cys Asp Leu Pro Gin Thr His Ser Leu 


20 


25 


30 


Gly Ser Arg Arg Thr Leu Met Leu Leu Ala Gin Met Arg Arg lie Ser 


35 


40 


45 


50 


Leu Phe Ser Cys Leu Lys Asp Arg His Asp Phe Gly Phe Pro Gin Glu 


50 


55 


60 


Glu Phe Gly Asn Gin Phe Gin Lys Ala Glu Thr lie Pro Val Leu His 


65 


70 


75 


80 
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Glu Met He Gin Gin He Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser 
85 90 95 

Ala Ala Trp Asp Glu Thr Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tvr 
100 105 110 Y 

Gin Gin Leu Asn Asp Leu Glu Ala Cys Val He Gin Gly Val Gly Val 
115 120 125 

Thr Glu Thr Pro Leu Met Lys Glu Asp Ser He Leu Ala Val Arg Lys 
1JU 135 140 

15 III ^ IlG LSU LyS G1U LyS Lys T * r Ser Pr ° 

150 155 i6o 

Cys Ala Trp Glu Val Val Arg Ala Glu He Met Arg Ser Phe Ser Leu 

i*7rt - 175 


10 


20 


165 170 

Ser Thr Asn Leu Gin Glu Ser Leu Arg Ser Lys Glu 
180 i 8 5 

<210> 10 

<211> 567 

25 <212> nucleic acid 

<400> 10 


<210> H 

40 <211> 567 

<212> nucleic acid 

<400> 11 


60 
120 
180 


™SSSI TGA CCT TTGCTTT ACTGGTGGCC CTCCTGGTGC TCAGCTGCAA GTCAAGCTGC 
^ JSJSSSP GTGATCTG CC TCAAACCCAC AGCCTGGGTA GCAGGAGGAC CTtcJ5S?c 
30 S?c™? TSSS?™ CTCTCTTTTC TCCTGCTTGA AGGACAGACA iSS 

TTTCCCCAGG AGGAGTTTGG CAACCAGTTC CAAAAGGCTG AAACCATCCC TGTCCTCCAT 24 0 

AGCAGATCTT CAATCTCTTC AGCACAAAGG ACTCAT CTGC 300 

G ?S^S CTCC TAGA CAAATT CTACACTGAA CTCTACCAGC AGCTGAATGA CCTGGAAGCC 360 

35 GCTGTGAGGA JSS^ GGTGACAGAG A CTCCCCTGA TGAAGGAgSa CTC^CTG 420 

° 2251? AGGA AATACTTCCA AAGAATCACT CTCTATCTGA AAGAGAAGAA ATACAGCCCT 4 80 

szsess ^sssss sr c atgagatctt tttctttgtc "° 


A e A J55 CCCTGA CCTTCGCCCT GCTGGTGGCC CTGCTGGTGC TGAGCTGCAA GAGCAGCTGC* fin 

CTGGCCC^I rrr^ C ° CCAGACCCAC AGCCTGGGGA GCCGGCGGAC 120 

CTGGCCCAGA TGCGGCGGAT CAGCCTGTTC AGCTGCCTGA AGGACCGGCA CGACTTCGGr inn 

«r2SSS G AGGAGTTCGG GAACCAGTTC CAGAAGGCCG AGACCATCCC CG^Sgc^C 240 

GAGArrrrrr AGCA GATCTT CAACCTGTTC AGCACCAAGG ACAGCAGCGC CGCC^Sc 300 
GAGACCCTGC TGGACAAGTT CTACACCGAG CTGTACCAGC AG CTGAACGA crTrJ-firSr-n 

50 ™S^ATCC AGGGGGTGGG GGTGACCGAG ACCCCCCTGA TGAAGGAGGA Sg^TC^G 420 

^l^ GGA AGTACTTCCA GCGGATCACC CTGTACCTGA AGGAGAAGAA GTaSccSc Ml 

SSSSS Sr ATGCGGAGCT TCAGCCTGAG SSSS S!S 


BNSDOC1D <WO 9&47678A2 I > 
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7 


10 


15 


<210> 
<211> 
<212> 
<223> 


<400> 

ATGGCNYTNA 
WSNGTNGGNT 
YTNGCNCARA 
TTYCCNCARG 
GARATGATHC 
GARACNYTNY 
TGYGTNATHC 
GCNGTNMGNA 
TGYGCNTGGG 
CARGARWSNY 


12 
567 

nucleic acid 
"Y" stands for 
"W" stands for 
stands for any 
12 


C or T; "R" 
T or A; "S" 
base . 


stands for A or G; 
stands for C or G; "N" 


CNTTYGCNYT 
GYGAYYTNCC 
TGMGNMGNAT 
ARGARTTYGG 
ARCARATHTT 
TNGAYAARTT 
ARGGNGTNGG 
ARTAYTTYCA 
ARGTNGTNMG 
TNMGNWSNAA 


NYTNGTNGCN 
NCARACNCAY 
HWSNYTNTTY 
NAAYCARTTY 
YAAYYTNTTY 
YTAYACNGAR 
NGTNACNGAR 
RMGNATHACN 
NGCNGARATH 
RGARTRR 


YTNYTNGTNY 
WSNYTNGGNW 
WSNTGYYTNA 
CARAARGCNG 
WSNACNAARG 
YTNTAYCARC 
ACNCCNYTNA 
YTNTAYYTNA 
ATGMGNWSNT 


TNWSNTGYAA 
SNMGNMGNAC 
ARGAYMGNCA 
ARACNATHCC 
AYWSNWSNGC 
ARYTNAAYGA 
TGAARGARGA 
ARGARAARAA 
TYWSNYTNWS 


RWSNWSNTGY 
NYTNATGYTN 
YGAYTTYGGN 
NGTNYTNCAY 
NGCNTGGGAY 
YYTNGARGCN 
YWSNATHYTN 
RTA^YWSNCCN 
NACNAAYYTN 


60 
120 
180 
240 
300 
360 
420 
480 
540 
567 


20 


25 


<210> 

<211> 
<212> 
<400> 


13 
191 

nucleic acid 
13 


GGGTGGCATC CCTGTGACCC CTCCCCAGTG CCTCTCCTGG CCCTGGAAGT TGCCACTCCA 
GTGCCCACCA GCCTTGTCCT AATAAAATTA AGTTGCATCA TTTTGTCTGA CTAGGTGTCC 
TTCTATAATA TTATGGGGTG GAGGGGGGTG GTATGGAGCA AGGGGCAAGT TGGGAAGACA 
ACCTGTAGGG C 


60 
120 
180 
191 


30 


<210> 
<211> 
<212> 
<400> 


14 
58 

nucleic acid 
14 


35 AAGCTTACTC AACACAATAA CAAACTTACT TACAATCTTA ATTAACAGGC CACCATGG 


40 


<210> 

<211> 
<212> 
<400> 


15 
15 

nucleic acid 
15 


CAGGT7AAGTG TCTTC 


15 
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<210> 
<211> 
<212> 


16 
30 

nucleic acid 


10 


15 


20 


<400> 16 

TACTAACGGT TCTTTTTTTC TCTTCACAGG 
<210> 
<211> 
<212> 
<400> 

TCTAGAGCAT TTTTCCCTCT rrr^^, mm 

TGACGTCTGG CTaI£££ AAATTTATTT SSS***** ^GCATC 

£^£ CACT CGGT ACTAGA GCATTTTTCC CTCTrr^I ^1^^ AA^TTTTTGT 
GCCCCTTGAG CATCTGACGT CTG G CT AATA S^AflfS^ AATTATGGGG ACATCATGAA 
TTGGAATTTT TTGTGTCTCT CACTCGGTAC £ AGGAflATTT ATTTTCATTG CAATAGTGTG 


17 

271 

nucleic acid 
17 


25 


<210> 
<211> 
<212> 
<400> 


CCGGCCACAG 
GGCATCGCCA 
GAACAGTTCG 
ACCGGCTTCC 
GCAGGTAGCC 
CTCGGCAGGA 
J U CCAGTCCCTT 
GGCCAGCCAC 
GGTCTTGACA 
GCAGCCGATT 
AGAACCTGCG 

3 5 ATCAGATCTT 

TTTGCAGGGC 
TACATAACTT 
GTCAATAATG 
GGTGGAGTAT 

4 0 TACGCCCCCT 

GACCTTATGG 
GGTGATGCGG 
TCCAAGTCTC 
CTTTCCAAAA 
4 ^ GTGGGAGGTC 
TCCACGCTGT 
ACGGTGCATT 
TCCTTCCCCT 
TAGCAGTAAT 
-> 0 GCCTCCTCCT 
CCGTGGCCAC 
GGGCCGTCAG 
CTGAAGAGAT 
TACCATTGGA 
~> 5 CTAATGGGAG 
GTATTTATGA 
TGATGGATCC 


18 

5686 

nucleic acid 
18 


TCGATGAATC 
TGCGTCACGA 
GCTGGCGCGA 
ATCCGAGTAC 
GGATCAAGCG 
GCAAGGTGAG 
CCCGCTTCAG 
GATAGCCGCG 
AAAAGAACCG 
GTCTGTTGTG 
TGCAATCCAT 
GATCCCCTGC 
TTCCCAACCT 
ACGGTAAATG 
ACGTATGTTC 
TTACGGTAAA 
ATTGACGTCA 
GACTTTCCTA 
TTTTGGCAGT 
CACCCCATTG 
TGTCGTAACA 
TATATAAGCA 
TTTGACCTCC 
GGAACGCGGA 
GCTATTCTGC 
ACTAACGGTT 
TGTGGCTACC 
TCCAGACCCA 
CAACATGCTC 
TGATCATGAA 
ATTAACCAAG 
TTGCCTGGCC 
AGACTTGAAG 
TAAGAGGCAG 


CAGAAAAGCG 
CGAGATCCTC 
GCCCCTGATG 
GTGCTCGCTC 
TATGCAGCCG 
ATGACAGGAG 
TGACAACGTC 
CTGCCTCGTC 
GGCGCCCCTG 
CCCAGTCATA 
CTTGTTCAAT 
GCCATCAGAT 
TACCAGAGGG 
GCCCGCCTGG 
CCATAGTAAC 
CTGCCCACTT 
ATGACGGTAA 
CTTGGCAGTA 
ACATCAATGG 
ACGTCAATGG 
ACTCCGCCCC 
GAGCTCGTTT 
ATAGAAGACA 
TTCCCCGTGT 
TCAACCTTCC 
CTTTTTTTCT 
CTGGTCCTCC 
GGAATGTTCC 
CAGAAGGCCA 
GATATCACAA 
AATGAGAGTT 
TCCAGAAAGA 
ATGTACCAGG 
ATCTTTCTAG 


GCCATTTTCC 
GCCGTCGGGC 
CTCTTCGTCC 
GATG CGATGT 
CCGCATTGCA 
ATCCTGCCCC 
GAGCACAGCT 
CTGCAGTTCA 
CGCTGACAGC 
GCCGAATAGC 
CATGCGAAAC 
CCTTGGCGGC 
CGAATTCGAG 
CTGACCGCCC 
GCCAATAGGG 
GGCAGTACAT 
ATGGCCCGCC 
CATCTACGTA 
GCGTGGATAG 
GAGTTTGTTT 
ATTGACGCAA 
AGTGAACCGT 
CCGGGACCGA 
TAATTAACAG 
TATCAGAAAC 
CTTCACAGGC 
TGGACCACCT 
CATGCCTTCA 
GACAAACTCT 
AAGATAAAAC 
GCCTAAATTC 
CCTCTTTTAT 
TGGAGTTCAA 
ATCAAAACAT 


ACCATGATAT 
ATGCGCGCCT 
AGATCATCCT 
TTCGCTTGGT 
TCAGCCATGA 
GGCACTTCGC 
GCGCAAGGAA 
TTCAGGGCAC 
CGGAACACGG 
CTCTCCACCC 
GATCCTCATC 
AAGAAAGCCA 
CTTGCATGCC 
AACGACCCCC 
ACTTTCCATT 
CAAGTGTATC 
TGGCATTATG 
TTAGTCATCG 
CGGTTTGACT 
TGGCACCAAA 
ATGGGCGGTA 
CAGATCGCCT 
TCCAGCCTCC 
GTAAGTGTCT 
TGCAGTATCT 
CACCATGGGT 
CAGTTTGGCC 
CCACTCCCAA 
AGAATTTTAC 
CAGCACAGTG 
CAGAGAGACC 
GATGGCCCTG 
GACCATGAAT 
GCTGGCAGTT 


TCGGCAAGCA 
TGAGCCTGGC 
GATCGACAAG 
GGTCGAATGG 
TGGATACTTT 
CCAATAGCAG 
CGCCCGTCGT 
CGGACAGGTC 
CGGCATCAGA 
AAGCGGCCGG 
CTGTCTCTTG 
TCCAGT ttac 
TGCAGGTCGT 
GCCCATTGAC 
GACGTCAATG 
ATATGCCAAG 
CCCAGTACAT 
CTATTACCAT 
CACGGGGATT 
ATCAACGGGA 
GGCGTGTACG 
GGAGACGCCA 
GCGGCCGGGA 
TCCTCCTGTT 
GTATTTTTGC 
CCAGCGCGCA 
AGAAACCTCC 
AACCTGCTGA 
CCTTGCACTT 
GAGGCCTGTT 
TCTTTCATAA 
TGCCTTAGTA 
GCAAAGCTTC 
ATTGATGAGC 


60 
120 
180 
240 

271 


60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 


30 
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9 


TGATGCAGGC CCTGAATTTC AACAGTGAGA 
CGGATTTTTA TAAAACTAAA ATCAAGCTCT 
CAGTGACTAT TGATAGAGTG ATGAGCTATC 
GAATTCTGCA GGAATTGGGT GGCATCCCTG 
5 GGAAGTTGCC ACTCCAGTGC CCACCAGCCT 
GTCTGACTAG GTGTCCTTCT ATAATATTAT 
GCAAGTTGGG AAGACAACCT GTAGGGCTCG 
TTACGGTAAA TGGCCCGCCT GGCTGACCGC 
TGACGTATGT TCCCATAGTA ACGCCAATAG 
10 ATTTACGGTA AACTGCCCAC TTGGCAGTAC 
CTATTGACGT CAATGACGGT AAATGGCCCG 
GGGACTTTCC TACTTGGCAG TACATCTACG 
GGTTTTGGCA GTACATCAAT GGGCGTGGAT 
TCCACCCCAT TGACGTCAAT GGGAGTTTGT 
15 AATGTCGTAA CAACTCCGCC CCATTGACGC 
TCTATATAAG CAGAGCTCGT TTAGTGAACC 
GTTTTGACCT CCATAGAAGA CACCGGGACC 
TTGGAACGCG GATTCCCCGT GTTAATTAAC 
CTGCTATTCT GCTCAACCTT CCTATCAGAA 
20 TACTAACGGT TCTTTTTTTC TCTT CACAGG 
TCTCTTGGTT TTCCCTGGTT TTTCTGGCAT 
AAGATGTTTA TGTCGTAGAA TTGGATTGGT 
TCACCTGTGA CACCCCTGAA GAAGATGGTA 
TCTTAGGCTC TGGCAAAACC CTGACCATCC 
25 ACACCTGTCA CAAAGGAGGC GAGGTTCTAA 
AAGATGGAAT TTGGTCCACT GATATTTTAA 
TTCTAAGATG CGAGGCCAAG AATTATTCTG 
TCAGTACTGA TTTGACATTC AGTGTCAAAA 
TGACGTGCGG AGCTGCTACA CTCTCTGCAG 
30 AGTACTCAGT GGAGTGCCAG GAGGACAGTG 
TTGAGGTCAT GGTGGATGCC GTTCACAAGC 
TCATCAGGGA CAT CATCAAA CCTGACCCAC 
ATTCTCGGCA GGTGGAGGTC AGCTGGGAGT 
ACTTCTCCCT GACATTCTGC GTTCAGGTCC 
35 GAGTCTTCAC GGACAAGACC TCAGCCACGG 
TGCGGGCCCA GGACCGCTAC TATAGCTCAT 
GTTAGACGCG CTAGAAAAGC CGAATTCTGC 
CCCAGTGCCT CTCCTGGCCC TGGAAGTTGC 
AAAATTAAGT TGCATCATTT TGTCTGACTA 
40 GGGGGTGGTA TGGAGCAAGG GGCAAGTTGG 
CCGGTACCAG CTTTTGTTCC CTTTAGTGAG 
CATAGCTGTT TCCTGTGTGA AATTGTTATC 
GAAGCATAAA GTGTAAAGCC TGGGGTGCCT 
TGCGCTCACT GCCCGCTTTC CAGTCGGGAA 
4 5 GCCAACGCGC GGGGAGAGGC GGTTTGCGTA 
ACTCGCTGCG CTCGGTCGTT CGGCTGCGGC 
TACGGTTATC CACAGAATCA GGGGATAACG 
AAAAGGCCAG GAACCGTAAA AAGGCCGCGT 
CTGACGAGCA TCACAAAAAT CGACGCTCAA 
50 AAAGATACCA GGCGTTTCCC CCTGGAAGCT 
CGCTTACCGG ATACCTGTCC GCCTTTCTCC 
CACGCTGTAG GTATCTCAGT TCGGTGTAGG 
' AACCCCCCGT TCAGCCCGAC CGCTGCGCCT 
CGGTAAGACA CGACTTATCG CCACTGGCAG 
55 GGTATGTAGG CGGTGCTACA GAGTTCTTGA 
GGACAGTATT TGGTATCTGC GCTCTGCTGA 
GCTCTTGATC CGGCAAACAA ACCACCGCTG 
AGATTACGCG CAGAAAAAAA GGATCTCAAG 
ACGCTCAGAA GAACTCGTCA AGAAGGCGAT 
60 CGATACCGTA AAGCACGAGG AAGCGGTCAG 
CACGGGTAGC CAACGCTATG TCCTGATAGC 


CTGTGCCACA AAAATCCTCC CTTGAAGAAC 2100 

GCATACTTCT TCATGCTTTC AGAATTCGGG 2160 

TGAATGCTTC CTAACAATTC TAGAAAAGCC 2220 

TGACCCCTCC CCAGTGCCTC TCCTGGCCCT 2280 

TGTCCTAATA AAATTAAGTT GCATCATTTT 2340 

GGGGTGGAGG GGGGTGGTAT GGAGCAAGGG 2400 

AGGGGGGGCC CGGTACGGTC GTTACATAAC 24 60 

CCAACGACCC CCGCCCATTG ACGTCAATAA 2520 

GGACTTTCCA TTGACGTCAA TGGGTGGAGT 2580 

ATCAAGTGTA TCATATGCCA AGTACGCCCC 2640 

CCTGGCATTA TGCCCAGTAC ATGACCTTAT 2700 

TATTAGTCAT CGCTATTACC ATGGTGATGC 27 60 

AGCGGTTTGA CTCACGGGGA TTTCCAAGTC 2820 

TTTGGCACCA AAATCAACGG GACTTTCCAA 2880 

AAATGGGCGG TAGGCGTGTA CGGTGGGAGG 2940 

GTCAGATCGC CTGGAGACGC CATCCACGCT 3000 

GATCCAGCCT CCGCGGCCGG GAACGGTGCA 3060 

AGGTAAGTGT CTTCCTCCTG TTTCCTTCCC 3120 

ACTGCAGTAT CTGTATTTTT GCTAGAATTG 3180 

CTTAAGTCAT GGGTCACCAG CAGTTGGTCA 3240 

CTCCCCTCGT GGCCATATGG GAACTGAAGA 3300 

ATCCGGATGC CCCTGGAGAA ATGGTGGTCC 3360 

TCACCTGGAC CTTGGACCAG AGCAGTGAGG 3420 

AAGTCAAAGA GTTTGGAGAT GCTGGCCAGT 3480 

GCCATTCGCT CCTGCTGCTT CACAAAAAGG 354 0 

AGGACCAGAA AGAACCCAAA AATAAGACCT 3600 

GACGTTTCAC CTGCTGGTGG CTGACGACAA 3660 

GCAGCAGAGG CTCTTCTGAC CCCCAAGGGG 3720 

AGAGAGTCAG AGGGGACAAC AAGGAGTATG 3780 

CCTGCCCAGC TGCTGAGGAG AGTCTGCCCA 384 0 

TCAAGTATGA AAACTACACC AGCAGCTTCT 3900 

CCAAGAACTT GCAGCTGAAG CCATTAAAGA 3960 

ACCCTGACAC CTGGAGTACT CCACATTCCT 4 020 

AGGGCAAGAG CAAGAGAGAA AAGAAAGATA 4080 

TCATCTGCCG CAAAAATGCC AGCATTAGCG 414 0 

CTTGGAGCGA ATGGGCATCT GTGCCCTGCA 4200 

AGGAATTGGG TGGCATCCCT GTGACCCCTC 42 60 

CACTCCAGTG CCCACCAGCC TTGTCCTAAT 4320 

GGTGTCCTTC TATAATATTA TGGGGTGGAG 4 380 

GAAGACAACC TGTAGGGCTC GAGGGGGGGC 4 4 40 

GGTTAATTTC GAGCTTGGCG TAATCATGGT 4 500 

CGCTCACAAT TCCACACAAC ATACGAGCCG 4560 

AATGAGTGAG CTAACTCACA TTAATTGCGT 4 620 

ACCTGTCGTG CCAGCTGCAT TAATGAATCG 4 680 

TTGGGCGCTC TTCCGCTTCC TCGCTCACTG 474 0 

GAGCGGTATC AGCT CACTCA AAGGCGGTAA 4800 

CAGGAAAGAA CATGTGAGCA AAAGGCCAGC 4 860 

TGCTGGCGTT TTTCCATAGG CTCCGCCCCC 4 920 

GTCAGAGGTG GCGAAACCCG ACAGGACTAT 4 980 

CCCTCGTGCG CTCTCCTGTT CCGACCCTGC 5040 

CTTCGGGAAG CGTGGCGCTT TCTCATAGCT 5100 

TCGTTCGCTC CAAGCTGGGC TGTGTGCACG 5160 

TATCCGGTAA CTATCGTCTT GAGTCCAACC 5220 

CAGCCACTGG TAACAGGATT AGCAGAGCGA 5280 

AGTGGTGGCC TAACTACGGC TACACTAGAA 5340 

AGCCAGTTAC CTTCGGAAAA AGAGTTGGTA 5400 

GTAGCGGTGG TTTTTTTGTT TGCAAGCAGC 54 60 

AAGATCCTTT GATCTTTTCT ACGGGGTCTG 5520 

AGAAGGCGAT GCGCTGCGAA TCGGGAGCGG 5580 

CCCATTCGCC GCCAAGCTCT TCAGCAATAT 564 0 

GGTCCGCCAC ACCCAG 5686 
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10 


<210> 
<211> 
<212> 
<400> 


19 

3426 

nucleic acid 
19 


10 


15 


20 


25 


30 


35 


40 


45 


50 


55 


60 


S£S££ £S?G? JS^ST TACTACTGGT 

TGACAGTCCT GGAAGAAATG AGA^AC^C C^Z^Z^ CCTCGGGAAC AAGAGGGCCT 
ATTTTGGATT CCCCTTGGAG AA^gS^A ACcSSrl^ S^ 0 ™** 3 ^^GAAGG 
TTGTGCTAAG AGATCTTArr ra^,^ ACCAACAGAT CCAGAAGGCT CAAGCCATrr 
CTACTTGGAA TgS SUSSSI 

ATCTCAAAGC CTGTGTGATG GAGGAACCTC SSr^™ CCTCCATCAG CAGCTCAATG 
TGAGGACATA CTTCCACAGG ATCACTGTGT SSSS^ GGAAGACTCC CTGCTGGCTG 
CCTGGGAGGT GATCAGAGCA GAAGOTgS PA^SS^ GAAGAAACAC AGCCTCTGTG 
CAAGACTGAG TGAGGAGAAG G^gScta SKS^ TTCCTCAA CC AACTTGCTGG 
^SS GACCC CTCCCCAGTG CC^AA^ 1^°^^ AT7TGGCATC 

GCCTTGTCCT AATAAAATTA AGTTGCAtS m^rl JS?*™ 0 * GTGCCCACCA 
TTATGGGGTG GAGGGGGGTG GTATGGAgS IZJJZZZZ^ CTAGGTGTCC TTCTATAATA 
GGCTCGAGGG GGGGCCCGGT AcSEsSS? r^SS^ GTTGGG AAGA CAACCTGTAG 
TGGCGTAATC ATGGTCATAG CTGTTTCCTG ^rv^ZZ* GTGAGGG ™ ATTTCGAGCT 
ACAACATACG AGCCGGAAGC ATAAAgSa I5S£5£ G TTATCGGG TC ACAATTCCAC 
TCACATTAAT TGCGTTGCGC TCA^GcSg CTTTCTArrr J GCCTAATGA GTGAGCTAAC 
TGCATTAATG AATCGGCCAA CGCGCG^rra. ^IJJSS^ GGGAAACCTG TCGTGCCAGC 
CTTCCTCGCT CACTGACTCG CTGCGCTcrr S^SSS CTTT GCG TATTGGG CGCTCTTCCG 
ACTCAAAGGC GGTAATA^GG SScCACAG ISSSST GCGGCGAGGG GTATCAGCTC 
GAGCAAAAGG CCAGCAAAAG GC^GA^CC GTAAAfincrr JAACGCAGGA AAGAACATGT 
ATAGGCTCCG CCCCCCTGAC GAGCATCAPA SJISSS 6 * 30 CGCGT TGCTG GCGTTTTTCC 
ACCCGACAGG ACTATAAAGA CTCAAGTGAG AGGTGGCGA^ 

CTGTTCCGAC CCTGCCGCTT ACcSaSS ?£?r£SS iSS™ 0 **™ GT GCGCTCTC 
CGCTTTCTCA TAGCTCACGC TGTAMTATC JSSSS^ TCTCCG TTCG GGAAGCGTGG 
TGGGCTGTGT GCACGAACCC CCCGTTC^r JS^T 000 * GTAGGTCGTT CGCTCCAAGC 
GTCTTGAGTC CAACCcS^A SSESS CGCCTTATCG g GTAAC?£tc 

GGATTAGCAG AGCGAGGTAT G^AGGCGgS SSS" 50 " 300 AG TGGTAACA 

ACGGCTACAC TAGAAGGACA GTATTTGGTA ^TCrrrvZZ Z™**™* TGGCCTAACT 
GAAAAAGAGT TGGTAGCTCT TGATCCGGcS I2S2£S GCTGAAGCCA GTTACCTTCG 
TTGTTTGCAA GCAGCAGATT AOGCGoSS iJSi??* 0 CGCTGG TAGC GGTGGTTTTT 
TTTCTACGGG GTCTGACGCT PirSSSSS AAAAAG GATC TCAAGAAGAT CCTTTGATCT 
GCGAATCGGG AgSgcSaS CCG^S crl^™ G GcSSSSS 
GCTCTTCAGC AATATCACGG GTarrranr^ CGAGGAAGCG GTCAGCCCAT TCGCCGCCAA 
GCCGGCCACA CTCGaSSS S^SSiS SI ATGTCCTG ATAGCGGTCC GCCACACCct 
AGGCATCGCC ATGcS 2£££S2g SSSSS^ OCCM « W TTCgScaSgC 
CGAACAGTTC GGCTGGCGCG AGoSSJS? SSSSSS CATGCGGGG C TTGAGCCTGG 
GACCGGCTTC CATCCGAGTA CG^GCTCOT SISSSS CAGATCATCC TGATCGACAA 
GGCAGGTAGC CGGATCAAGC GTATGCArrr SSJS ° ATG TTTCGCTTGG TGGTCGAATG 
TCTCGGCAGG AGCAAGGTGA StgaSSa rSS^ ATCAGCCATG ATGGATACTT 
GCCAGTCCCT TCCCGCTTCA ct£SSS£ rrlrZZn^ GGGG ACTTCG CCCAATAGCA 
TGGCCAGCCA CGATAGCCGC GCTGC^cS CCTnrnrTTp TGCGCAAGGA ACGCCCGTCG 
CGGTCTTGAC AAAAAGAACC GGGCgScS Gcl^zZIr A JJ CAGGGCA CCGGACAGGT 
AGCAGCCGAT TGTCTGTTGT GCCCAGTcS ar^n *S 2 CCGGAACACG GCGGCATCAG 
GAGAACCTGC GTGCAATCGA TCTTGTTCAA S r™£ CCACC «MCOecCG 
GATCAGATCT TGATCCCPTP er^tiZZt? CATGCGAAA CGATCCTCAT CCTGTCTCTT 
CTTTGCAGGG 5?aCCA^GG ££S£SS 

TTACATAACT TACGGTAAAT GGCCCG^G SS^JJ?* GCTTGCATGC CTGCAGGTCG 
CGTCAATAAT GACGTATGT? CC^AgS^ ?r?r A S G S C OWOMCCC CGCCCATTGA 
GGGTGGAGTA TTTACGGTAA ACTGCCCACT SSSJI^f GACTTTCCAT TGACGTCAAT 
GTACGCCCCC TATTGACGTC AATGACGgS ISr^r^ GAAGTG TAT CATATGCCAA 
TGACCTTATG GGACTTTCCT ACTTGGC^g? a^ISS^ 0 CT «5CATTAT GCCCAGTACA 
TGGTGATGCG GTTTTGGCAG iSSSSS SJ2 f ATTAGTCATC GCTATTACCA 
TTCCAAGTCT CCACCCCATT GACGTCAATG C^rZ^ A GGG GTTTGAC TCACGGGGAT 
ACTTTCCAAA ATGTCGTAAC AACTCCGCCr ™Sirj GTT TTGGG ACCAA AATCAACGGG 
GGTGGGAGGT CTATATAAGC A^CTCgS SSSSSS? AA J GGGCGGT AGGG GTGTAC 
A.CCACGCTG TTTTGACCTC CATAGAAGAC SS Sg^gS 
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45 
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60 


AGCTTCGAGG 
TTGGCGTAAT 
CACAACATAC 
CTCACATTAA 
CTGCATTAAT 
GCTTCCTCGC 
CACTCAAAGG 
TGAGCAAAAG 
CATAGGCTCC 
AACCCGACAG 
CCTGTTCCGA 
GCGCTTTCTC 
CTGGGCTGTG 
CGT CTTGAGT 
AGGATTAGCA 
TACGG CT ACA 
GGAAAAAGAG 
TTTGTTTGCA 
TTTTCTACGG 
TGCGAATCGG 
AGCTCTTCAG 
AGCCGGCCAC 
CAGGCATCGC 
GCGAACAGTT 
AGACCGGCTT 
GGGCAGGTAG 
TTCTCGGCAG 
AGCCAGTCCC 
GTGGCCAGCC 
TCGGTCTTGA 
GAGCAGCCGA 
GGAGAACCTG 
TGATCAGATC 
ACTTTGCAGG 
GCTGTCCATA 
AGCTTGCATG 
CCAACGACCC 
GGACTTTCCA 
ATCAAGTGTA 
CCTGGCATTA 
TATTAGTCAT 
AGCGGTTTGA 
TTTGGCACCA 
• AAATGGGCGG 
GTCAGATCGC 
GATCCAGCCT 
AGGTAAGTGT 
ACTGCAGTAT 
GCCACGATGT 
CTCAGTTTGG 
AACCTGCTGA 
TCCTGCACTG 
AAGACCTGTT 
TCTTCCACAA 
TGCCTTGGTA 
GCAGCACTTC 
AT CGATGAGC 
GTGGGAGAAG 
AGCACCCGCG 


GGGGGCCCGG 
CATGGTCATA 
GAGCCGGAAG 
TTGCGTTGCG 
GAATCGGCCA 
TCACTGACTC 
CGGTAATACG 
GCCAGCAAAA 
GCCCCCCTGA 
GACTATAAAG 
CCCTGCCGCT 
ATAGCTCACG 
TGCACGAACC 
CCAACCCGGT 
GAGCGAGGTA 
CTAGAAGGAC 
TTGGTAGCTC 
AGCAGCAGAT 
GGTCTGACGC 
GAGCGGCGAT 
CAATATCACG 
AGTCGATGAA 
CATGCGTCAC 
CGGCTGGCGC 
CCATCCGAGT 
CCGGATCAAG 
GAGCAAGGTG 
TTCCCGCTTC 
ACGATAGCCG 
CAAAAAGAAC 
TTGTCTGTTG 
CGTGCAATCC 
TTGATCCCCT 
GCTTCCCAAC 
AAACCGCCCA 
CCTGCAGGTC 
CCGCCCATTG 
TTGACGTCAA 
TCATATGCCA 
TGCCCAGTAC 
CGCTATTACC 
CTCACGGGGA 
AAATCAACGG 
TAGGCGTGTA- 
CTGGAGACGC 
CCGCGGCCGG 
CTTCCTCCTG 
CTGTATTTTT 
GTCAATCACG 
CCAGGGTCAT 
AGACCACAGA 
CTGAAGACAT 
TACCACTGGA 
CAAGAGGGAG 
GCATCTATGA 
AGAATCACAA 
TGATGCAGTC 
CAGACCCTTA 
TCGTGACCAT 


TACCAGCTTT 
GCTGTTTCCT 
CATAAAGTGT 
CTCACTGCCC 
ACGCGCGGGG 
GCTGCGCTCG 
GTT AT CCACA 
GGCCAGGAAC 
CGAGCATCAC 
ATACCAGGCG 
TACCGGATAC 
CTGTAGGTAT 
CCCCGTTCAG 
AAGACACGAC 
TGTAGGCGGT 
AGTATTTGGT 
TTGATCCGGC 
TACGCGCAGA 
TCAGAAGAAC 
ACCGTAAAGC 
GGTAGCCAAC 
TCCAGAAAAG 
GACGAGATCC 
GAGCCCCTGA 
ACGTGCTCGC 
CGTATGCAGC 
AGATGACAGG 
AGTGACAACG 
CGCTGCCTCG 
CGGGCGCCCC 
TGCCCAGTCA 
ATCTTGTTCA 
GCGCCATCAG 
CTTACCAGAG 
GTCTAGCAAC 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGAGT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 
GACTTTCCAA 
CGGTGGGAGG 
CATCCACGCT 
GAACGGTGCA 
TTTCCTTCCC 
GCTAGCAGTA 
CTACCTCCTC 
TCCAGTCTCT 
TGACATGGTG 
CGATCATGAA 
ACTACACAAG 
CTGCCTGCCC 
GGACTTGAAG 
CCAT CAGCAG 
TCTGAATCAT 
CAGAGTGAAA 
CAACAGGGTG 


TGTTCCCTTT 
GTGTGAAATT 
AAAGCCTGGG 
GCTTTCCAGT 
AGAGGCGGTT 
GTCGTTCGGC 
GAATCAGGGG 
CGTAAAAAGG 
AAAAATCGAC 
TTTCCCCCTG 
CTGTCCGCCT 
CTCAGTTCGG 
CCCGACCGCT 
TTATCGCCAC 
GCTACAGAGT 
ATCTGCGCTC 
AAACAAACCA 
AAAAAAGGAT 
TCGTCAAGAA 
ACGAGGAAGC 
GCTATGTCCT 
CGGCCATTTT 
TCGCCGTCGG 
TGCTCTTCGT 
TCGATGCGAT 
CGCCGCATTG 
AGATCCTGCC 
TCGAGCACAG 
TCCTGCAGTT 
TGCGCTGACA 
TAGCCGAATA 
ATCATGCGAA 
ATCCTTGGCG 
GGCGCCCCAG 
TGTTGGGAAG 
TTACGGTAAA 
TGACGTATGT 
ATTTACGGTA 
CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 
AATGTCGTAA 
TCTATATAAG 
GTTTTGACCT 
TTGGAACGCG 
CTGCTATTCT 
ATACTAACGG 
TTTTTGGCCA 
GGACCTGCCA 
AAGACGGCCA 
GACATCACAC 
AACGAGAGTT 
CCACAGAAGA 
ATGTACCAGA 
ATCATTCTAG 
AAT GGCGAGA 
ATGAAGCTCT 
ATGGGCTATC 


AGTGAGGGTT 
GTTATCCGCT 
GTGCCTAATG 
CGGGAAACCT 
TGCGTATTGG 
TGCGGCGAGC 
ATAACGCAGG 
CCGCGTTGCT 
GCTCAAGTCA 
GAAGCTCCCT 
TTCTCCCTTC 
TGTAGGTCGT 
GCGCCTTATC 
TGGCAGCAGC 
TCTTGAAGTG 
TGCTGAAGCC 
CCGCTGGTAG 
CTCAAGAAGA 
GGCGATAGAA 
GGTCAGCCCA 
GATAGCGGTC 
CCACCATGAT 
GCATGCGCGC 
CCAGATCATC 
GTTTCGCTTG 
CAT CAGCC AT 
CCGGCACTTC 
CTGCGCAAGG 
CATTCAGGGC 
GCCGGAACAC 
GCCTCTCCAC 
ACGATCCTCA 
GCAAGAAAGC 
CTGGCAATTC 
GGCGGGGCTG 
TGGCCCGCCT 
TCCCATAGTA 
AACTGCCCAC 
CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTCGT 
CCATAGAAGA 
GATTCCCCGT 
GCTCAACCTT 
TTCTTTTTTT 
CCCTTGCCCT 
GGTGTCTTAG 
GAGAAAAACT 
GGGACCAAAC 
GCCTGGCTAC 
CGTCTTTGAT 
CAGAGTTCCA 
ACAAGGGCAT 
CTCTGCGCCA 
GCATCCTGCT 
TGAGCTCCGC 


AATTTCGAGC 
CACAATTCCA 
AGTGAGCTAA 
GTCGTGCCAG 
GCGCTCTTCC 
GGTATCAGCT 
AAAGAACATG 
GGCGTTTTTC 
GAGGTGGCGA 
CGTGCGCTCT 
GGGAAGCGTG 
TCGCTCCAAG 
CGGTAACTAT 
CACTGGTAAC 
GTGGCCTAAC 
AGTTACCTTC 
CGGTGGTTTT 
TCCTTTGATC 
GGCGATGCGC 
TTCGCCGCCA 
CGCCACACCC 
ATTCGGCAAG 
CTTGAGCCTG 
CTGATCGACA 
GTGGTCGAAT 
GATGGATACT 
GCCCAATAGC 
AACGCCCGTC 
ACCGGACAGG 
GGCGGCATCA 
CCAAGCGGCC 
TCCTGTCTCT 
CATCCAGTTT 
CGGTTCGCTT 
CAGGAATTCG 
GGCTGACCGC 
ACGCCAATAG 
TTGGCAGTAC 
AAATGGCCCG 
TACATCTACG 
GGGCGTGGAT 
GGGAGTTTGT 
CCATTGACGC 
TTAGTGAACC 
CACCGGGACC 
GTTAATTAAC 
CCTATCAGAA 
CTCTTCACAG 
CCTAAACCAC 
CCAGTCCCGA 
GAAACATTAT 
CAGCACATTG 
TAGAGAGACT 
GATGACCCTG 
GGCCATCAAC 
GCTGGTGGCC 
GAAACCTCCT 
TCACGCCTTC 
CTTAACTAGT 
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GAAAGCTCAA 
TAGAGCTCGC 
CTCCCCCGTG 
TGAGGAAATT 
5 GCAGGACAGC 
CTCTATGGCT 
GCAGGTCGTT 
CCCATTGACG 
ACGTCAATGG 
1 0 TATGCCAAGT 
CCAGTACATG 
TATTACCATG 
ACGGGGATTT 
TCAACGGGAC 
1 5 GCGTGTACGG 
GAGACGCCAT 
CGGCCGGGAA 
CCTCCTGTTT 
TATTTTTGCT 
2 0 CTCAGAAGCT 
TGTGGGAGCT 
GAGAAACAGT 
ACCAGAGACA 
TAGATGCTGG 

2 5 TGCTCCACAA 

AGACTTTCCT 
TGCAAAGAAA 
GGGCAGTGAC 
ACTATGAGAA 

3 0 CCCTGCCCAT 

CCAGCTTCTT 
CTTTGAAGAA 
ATTCCTACTT 
AGACAGAGGA 
J 5 TCCAATGCAA 
GCAGCAAGTG 
ACTGTGCCTT 
CTGGAAGGTG 
CTGAGTAGGT 
4 0 TGGGAAGACA 
AGAACCAGCT 
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<210> 
<211> 
<212> 
<400> 

CGTTACATAA 
GACGTCAATA 
ATGGGTGGAG 
AAGTACGCCC 
CATGACCTTA 
CATGGTGATG 
ATTTCCAAGT 
GGACTTTCCA 
ACGGTGGGAG 
CCATCCACGC 
GGAACGGTGC 
GTTTCCTTCC 
TGCTAGCAGT 
TTGCTTTACT 
ATCTGCCTCA 
GGAGAATCTC 
AGTTTGGCAA 


GGCCCTCTGC 
TGATCAGCCT 
CCTTCCTTGA 
GCATCGCATT 
AAGGGGGAGG 
TCTGAGGCGG 
ACATAACTTA 
TCAATAATGA 
GTGGAGTATT 
ACGCCCCCTA 
ACCTTATGGG 
GTGATGCGGT 
CCAAGTCTCC 
TTTCCAAAAT 
TGGGAGGTCT 
CCACGCTGTT 
CGGTGCATTG 
CCTTCCCCTG 
AGCAGTAATA 
AACCATCTCC 
GGAGAAAGAC 
GAACCTCACC 
TGGAGTCATA 
CCAGTACACC 
GAAGGAAAAT 
GAAGTGTGAA 
CATGGACTTG 
ATGTGGAATG 
GTATTCAGTG 
TGAACTGGCG 
CATCAGGGAC 
CTCACAGGTG 
CTCCCTCAAG 
GGGGTGTAAC 
AGGCGGGAAT 
GGCATGTGTT 
CTAGTTGCCA 
CCACTCCCAC 
GTCATTCTAT 
ATAGCAGGCA 
GGGGCTCGAG 

21 


3589 

nucleic acid 
21 


CACAGCGCCC 
CGACTGTGCC 
CCCTGGAAGG 
GTCTGAGTAG 
ATTGGGAAGA 
AAAGAACCAG 
CGGTAAATGG 
CGTATGTTCC 
TACGGTAAAC 
TTGACGTCAA 
ACTTTCCTAC 
TTTGGCAGTA 
ACCCCATTGA 
GTCGTAACAA 
ATATAAGCAG 
TTGACCTCCA 
GAACGCGGAT 
CTATTCTGCT 
CTAACGGTTC 
TGGTTTGCCA 
GTTTATGTTG 
TGTGACACGC 
GGCTCTGGAA 
TGCCACAAAG 
GGAATTTGGT 
GCACCAAATT 
AAGTTCAACA 
GCGTCTCTGT 
TCCTGCCAGG 
TTGGAAGCAC 
ATCATCAAAC 
GAGGTCAGCT 
TTCTTTGTTC 
CAGAAAGGTG 
GTCTGCGTGC 
CCCTGCAGGG 
GCCATCTGTT 
TGTCCTTTCC 
TCTGGGGGGT 
TGCTGGGGAT 
CATGCA 


TCCTCACACA 
TTCTAGTTGC 
TGCCACTCCC 
GTGTCATTCT 
CAATAGCAGG 
CTGGGGCTCG 
CCCGCCTGGC 
CATAGTAACG 
TGCCCACTTG 
TGACGGTAAA 
TTGGCAGTAC 
CATCAATGGG 
CGTCAATGGG 
CTCCGCCCCA 
AGCTCGTTTA 
TAGAAGACAC 
TCCCCGTGTT 
CAACCTTCCT 
TTTTTTTCTC 
TCGTTTTGCT 
TAGAGGTGGA 
CTGAAGAAGA 
AGACCCTGAC 
GAGGCGAGAC 
CCACTGAAAT 
ACTCCGGACG 
TCAAGAGCAG 
CTGCAGAGAA 
AGGATGTCAC 
GGCAGCAGAA 
CAGACCCGCC 
GGGAGTACCC 
GAATCCAGCG 
CGTTCCTCGT 
AAGCTCAGGA 
TCCGATCCTA 
GTTTGCCCCT 
TAATAAAATG 
GGGGTGGGGC 
GCGGTGGGCT 


GATAGGAATT 
CAGCCATCTG 
ACTGTCCTTT 
ATTCTGGGGG 
CATGCTGGGG 
AAATTCGAGC 
TGACCGCCCA 
CCAATAGGGA 
GCAGTACATC 
TGGCCCGCCT 
ATCTACGTAT 
CGTGGATAGC 
AGTTTGTTTT 
TTGACGCAAA 
GTGAACCGTC 
CGGGACCGAT 
AATTAACAGG 
ATCAGAAACT 
TTCACAGGCC 
GGTGTCTCCA 
CTGGACTCCC 
TGACATCACC 
CATCACTGTC 
TCTGAGCCAC 
TTTAAAAAAT 
GTTCACGTGC 
TAGCAGTTCC 
GGTCACACTG 
CTGCCCAACT 
TAAATATGAG 
CAAGAACTTG 
TGACTCCTGG 
CAAGAAAGAA 
AGAGAAGACA 
TCGCTATTAC 
GAGCTCGCTG 
CCCCCGTGCC 
AGGAAAT TGC 
AGGACAGCAA 
CTATGGCTTC 


CTGCAGATCC 
TTGTTTGCCC 
CCTAATAAAA 
GTGGGGTGGG 
ATGCGGTGGG 
TTGCATGCCT 
ACGACCCCCG 
CTTTCCATTG 
AAGTGTATCA 
GGCATTATGC 
TAGTCATCGC 
GGTTTGACTC 
GGCACCAAAA 
TGGGCGGTAG 
AGATCGCCTG 
CCAGCCTCCG 
TAAGTGTCTT 
GCAGTATCTG 
ACGATGTGTC 
CTCATGGCCA 
GATGCCCCTG 
TGGACCTCAG 
AAAGAGTTTC 
TCACATCTGC 
TTCAAAAACA 
TCATGGCTGG 
CCTGACTCTC 
GACCAAAGGG 
GCCGAGGAGA 
AACTACAGCA 
CAGATGAAGC 
AGCACTCCCC 
AAGATGAAGG 
TCTACCGAAG 
AATTCCTCAT 
ATCAGCCTCG 
TTCCTTGACC 
ATCGCATTGT 
GGGGGAGGAT 
TGAGGCGGAA 


CTTACGGTAA 
ATGACGTATG 
TATTTACGGT 
CCTATTGACG 
TGGGACTTTC 
CGGTTTTGGC 
CTCCACCCCA 
AAATGTCGTA 
GTCTATATAA 
TGTTTTGACC 
ATTGGAACGC 
CCTGCTATTC 
AATACTAACG 
GGTGGCCCTC 
AACCCACAGC 
TCTTTTCTCC 
CCAGTTCCAA 


ATGGCCCGCC 
TTCCCATAGT 
AAACTGCCCA 
TCAATGACGG 
CTACTTGGCA 
AGTACATCAA 
TTGACGTCAA 
ACAACTCCGC 
GCAGAGCTCG 
TCCATAGAAG 
GGATTCCCCG 
TGCTCAACCT 
GTTCTTTTTT 
CTGGTGCTCA 
CTGGGTAGCA 
TGCTTGAAGG 
AAGGCTGAAA 


TGGCTGACCG 
■AACGCCAATA 
CTTGGCAGTA 
TAAATGGCCC 
GTACATCTAC 
TGGGCGTGGA 
TGGGAGTTTG 
CCCATTGACG 
TTTAGTGAAC 
ACACCGGGAC 
TGTTAATTAA 
TCCTATCAGA 
TCTCTTCACA 
GCTGCAAGTC 
GGAGGACCTT 
ACAGACATGA 
CCATCCCTGT 


CCCAACGACC 
GGGACTTTCC 
CAT CAAGTGT 
GCCTGGCATT 
GTATTAGTCA 
TAGCGGTTTG 
TTTTGGCACC 
CAAATGGGCG 
CGTCAGATCG 
CGATCCAGCC 
CAGGTAAGTG 
AACTGCAGTA 
GGCCACCATG 
AAGCTGCTCT 
GATGCTCCTG 
CTTTGGATTT 
CCTCCATGAG 


CCCGCCCATT 
ATTGACGTCA 
ATCATATGCC 
ATGCCCAGTA 
TCGCTATTAC 
ACTCACGGGG 
AAAATCAACG 
GTAGGCGTGT 
CCTGGAGACG 
TCCGCGGCCG 
TCTTCCTCCT 
TCTGTATTTT 
GCCTTGACCT 
GTGGGCTGTG 
GCACAGATGA 
CCCCAGGAGG 
ATGATCCAGC 
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AGATCTTCAA 
ACAAATTCTA 
GGGTGGGGGT 
ACTTCCAAAG 
TTGTCAGAGC 
GAAGTAAGGA 
CCCCTCCCCA 
CCTAATAAAA 
GTGGAGGGGG 
GGGGGCCCGG 
CATGGTCATA 
GAGCCGGAAG 
TTGCGTTGCG 
GAATCGGCCA 
TCACTGACTC 
CGGTAATACG 
GCCAGCAAAA 
GCCCCCCTGA 
GACTATAAAG 
CCCTGCCGCT 
ATAGCTCACG 
TGCACGAACC 
CCAACCCGGT 
GAGCGAGGTA 
CTAGAAGGAC 
TTGGTAGCTC 
AGCAGCAGAT 
GGTCTGACGC 
GAGCGGCGAT 
CAATATCACG 
AGTCGATGAA 
CATGCGTCAC 
CGGCTGGCGC 
CCATCCGAGT 
CCGGATCAAG 
GAGCAAGGTG 
TTCCCGCTTC 
ACGATAGCCG 
CAAAAAGAAC 
TTGTCTGTTG 
CGTGCAATCC 
TTGATCCCCT 
GCTTCCCAAC 


TCTCTTCAGC 
C ACT GAACTC 
GACAGAGACT 
AATCACTCTC 
AGAAATCATG 
ATGAATCTAG 
GTGCCTCTCC 
TTAAGTTGCA 
GTGGTATGGA 
TACCAGCTTT 
GCTGTTTCCT 
CATAAAGTGT 
CTCACTGCCC 
ACGCGCGGGG 
GCTGCGCTCG 
GTTATCCACA 
GGCCAGGAAC 
CGAGCATCAC 
ATACCAGGCG 
TACCGGATAC 
CTGTAGGTAT 
CCCCGTTCAG 
AAGACACGAC 
TGTAGGCGGT 
AGTATTTGGT 
TTGATCCGGC 
TACGCGCAGA 
TCAGAAGAAC 
ACCGTAAAGC 
GGTAGCCAAC 
TCCAGAAAAG 
GACGAGATCC 
GAGCCCCTGA 
ACGTGCTCGC 
CGTATGCAGC 
AGATGACAGG 
AGTGACAACG 
CGCTGCCTCG 
CGGGCGCCCC 
TGCCCAGTCA 
ATCTTGTTCA 
GCGCCATCAG 
CTTACCAGAG 


ACAAAGGACT 
TACCAGCAGC 
CCCCTGATGA 
TATCTGAAAG 
AGAT CTTTTT 
AAAAGCCGAA 
TGGCCCTGGA 
TCATTTTGTC 
GCAAGGGGCA 
TGTTCCCTTT 
GTGTGAAATT 
AAAGCCTGGG 
GCTTTCCAGT 
AGAGGCGGTT 
GTCGTTCGGC 
GAATCAGGGG 
CGTAAAAAGG 
AAAAATCGAC 
TTTCCCCCTG 
CTGTCCGCCT 
CTCAGTTCGG 
CCCGACCGCT 
TTATCGCCAC 
GCTACAGAGT 
AT CTGCGCT C 
AAACAAACCA 
AAAAAAGGAT 
TCGTCAAGAA 
ACGAGGAAGC 
GCTATGTCCT 
CGGCCATTTT 
TCGCCGTCGG 
TGCTCTTCGT 
TCGATGCGAT 
CGCCGCATTG 
AGATCCTGCC 
TCGAGCACAG 
TCCTGCAGTT 
TGCGCTGACA 
TAGCCGAATA 
ATCATGCGAA 
AT CCTTGGCG 
GGCGAATTCG 


CATCTGCTGC 
TGAATGACCT 
AGGAGGACTC 
AGAAGAAATA 
CTTTGTCAAC 
TTCTGCAGGA 
AGTTGCCACT 
TGACTAGGTG 
AGTTGGGAAG 
AGTGAGGGTT 
GTTATCCGCT 
GTGCCTAATG 
CGGGAAACCT 
TGCGTATTGG 
TGCGGCGAGC 
ATAACGCAGG 
CCGCGTTGCT 
GCTCAAGTCA 
GAAGCTCCCT 
TTCTCCCTTC 
TGTAGGTCGT 
GCGCCTTATC 
TGGCAGCAGC 
TCTTGAAGTG 
TGCTGAAGCC 
CCGCTGGTAG 
CTCAAGAAGA 
GGCGATAGAA 
GGTCAGCCCA 
GATAGCGGTC 
CCACCATGAT 
GCATGCGCGC 
CCAGATCATC 
GTTTCGCTTG 
CATCAGCCAT 
CCGGCACTTC 
CTGCGCAAGG 
CAT T C AGGGC 
GCCGGAACAC 
GCCTCTCCAC 
ACGATCCTCA 
GCAAGAAAGC 
AGCTTGCATG 


TTGGGATGAG 
GGAAGCCTGT 
CATTCTGGCT 
CAGCCCTTGT 
AAACTTGCAA 
ATTGGGTGGC 
CCAGTGCCCA 
TCCTTCTATA 
ACAACCTGTA 
AATTTCGAGC 
CACAATTCCA 
AGTGAGCTAA 
GTCGTGCCAG 
GCGCTCTTCC 
GGTATCAGCT 
AAAGAACATG 
GGCGTTTTTC 
GAGGTGGCGA 
CGTGCGCTCT 
GGGAAGCGTG 
TCGCTCCAAG 
CGGTAACTAT 
CACTGGTAAC 
GTGGCCTAAC 
AGTTACCTTC 
CGGTGGTTTT 
TCCTTTGATC 
GGCGATGCGC 
TTCGCCGCCA 
CGCCACACCC 
ATTCGGCAAG 
CTTGAGCCTG 
CTGATCGACA 
GTGGTCGAAT 
GATGGATACT 
GCCCAATAGC 
AACGCCCGTC 
ACCGGACAGG 
GGCGGCATCA 
CCAAGCGGCC 
TCCTGTCTCT 
CATCCAGTTT 
CCTGCAGGT 


ACCCTCCTAG 
GTGATACAGG 
GTGAGGAAAT 
GCCTGGGAGG 
GAAAGTTTAA 
ATCCCTGTGA 
CCAGCCTTGT 
ATATTATGGG 
GGGCTCGAGG 
TTGGCGTAAT 
CACAACATAC 
CTCACATTAA 
CTGCATTAAT 
GCTTCCTCGC 
CACTCAAAGG 
TGAGCAAAAG 
CATAGGCTCC 
AACCCGACAG 
CCTGTTCCGA 
GCGCTTTCTC 
CTGGGCTGTG 
CGTCTTGAGT 
AGGATTAGCA 
TACGGCTACA 
GGAAAAAGAG 
TTTGTTTGCA 
TTTTCTACGG 
TGCGAATCGG 
AGCTCTTCAG 
AGCCGGCCAC 
CAGGCATCGC 
GCGAACAGTT 
AGACCGGCTT 
GGGCAGGTAG 
TTCTCGGCAG 
AGCCAGTCCC 
GTGGCCAGCC 
TCGGTCTTGA 
GAGCAGCCGA 
GGAGAACCTG 
TGATCAGATC 
ACTTTGCAGG 


1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3589 


45 


<210> 
<211> 
<212> 
<400> 


22 
567 

nucleic acid 
22 


50 


55 


60 


ATGGCCTTGA 
TCTGTGGGCT 
CTGGCACAGA 
TTTCCCCAGG 
GAGATGATCC 
GAGACCCTCC 
TGTGTGATAC 
GCTGTGAGGA 
TGTGCCTGGG 
CAAGAAAGTT 


CCTTTGCTTT 
GTGATCTGCC 
TGAGGAGAAT 
AGGAGTTTGG 
AGCAG AT CTT 
TAGACAAATT 
AGGGGGTGGG 
AATACTTCCA 
AGGTTGTCAG 
TAAGAAGTAA 


ACTGGTGGCC 
TCAAACCCAC 
CTCTCTTTTC 
CAACCAGTTC 
CAATCTCTTC 
CTACACTGAA 
GGTGACAGAG 
AAGAATCACT 
AG CAGAAATC 
GGAATGA 


CTCCTGGTGC 
AGCCTGGGTA 
TCCTGCTTGA 
CAAAAGGCTG 
AGCACAAAGG 
CTCTACCAGC 
ACTCCCCTGA 
CTCTATCTGA 
ATGAGATCTT 


TCAGCTGCAA 
GCAGGAGGAC 
AGGACAGACA 
AAACCATCCC 
ACTCATCTGC 
AGCTGAATGA 
TGAAGGAGGA 
AAGAGAAGAA 
TTTCTTTGTC 


GTCAAGCTGC 
CTTGATGCTC 
TGACTTTGGA 
TGTCCTCCAT 
TGCTTGGGAT 
CCTGGAAGCC 
CTCCATTCTG 
ATACAGCCCT 
AACAAACTTG 


60 
120 
180 
240 
300 
360 
420 
480 
540 
567 
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<210> 23 
<211> 16 
<212> nucleic acid 

5 <1oO> 23" StandS C " T '' StandS b3Se - 

YYYYYYYYYY YNYAGG 

<210> 24 
10 <211> 660 

<212> nucleic acid 

<223> "Y" stands for C or T; »R» stands for A or G; 

"W" stands for T or A; »S" stands- for C or G; «N« 
stands for any base. 
15 <400> 24 

™£S SSSSSS ESSES SSSSS ?™™ GA ™ TNWSN 60 

sssss ssssss Eii iE? sssss ssss as 

GCNYTNTGYY TM.S5SS? JESSES SSiEST! "«»«« "<» 

ATGAAYGCNA ARYTNYTMAT GGAYCfTlAAP m™^™ AYCAR =™GA RTTYAARACN 420 

GCNGTNATHG AYGARYTNAT GCARGCNYTH l^S^I" 7 TYYTNG A*CA RAAYATGYTN 480 

WSNWSNYTNG JUwSsSS ACNAARATH& ESS* 0 * 1, NCCN ^ARAAR 54 0 

GCNTTYMGNA THMGNGCNGT N AcS£y ^SSSSS S2 


16 


25 


30 <210> 25 
<211> 987 
<212> nucleic acid 


35 


40 


<223> 


Y stands for C or T; "R« stands for A or G; 
stands for T or A; »S" stands for C or G; «N" 
stands for any base. 


<400> 25 


SSS SSSSSSS T ~ ™^™GC NWSNCCNYTN 

GCNCCNGGNG ARAlS £S2£5 ™ GAYTG ^CCNGAY 

ACNYTNGAYC ARWSNWSNGA RGTNYTN^rM S^m^* G ^^GAYGG NATHACNTGG 
GARTTYGGNG AYGCNGGNCA MASSES? ?f SSS^* ^^NACNAT HCARGTNAAR 
YTNYTNYTNY TNCAYAARAA ISSUES ^u^SS ° GNGAR GTNYT NWSNCAYWSN 
„ c AARGARCCNA ARAAYAARAC JJST CNGAYATHYT NAARGAYCAR 

4 5 ACNTGYTGGT GGYTNACNAC SSSSSS SJSSfSS * MAmWS NGGNMGNTTY 
GGWWSNWSNG AYCCNCARGG SISS SSKSSI ^ SNG ™^ RWSNWSNMGN 
MGNGGNGAYA AYAARGARTA CNYTNWSNGC NGARMGNGTN 

GCNGCNGARG ARWSNYTNCC nShgAR^M SSSJ*^ ARGARGAYWS NGCNTGYCCN 
cn GARAAYTAYA CNHSNWS^T ?I5S52™ G CWSrmc ^^ RYTNAARTAY 

50 YTNCARYTNA ARCCNYTNAA rIIwsS SSSSIS* "^^VCC NCCNAARAAY 

acntggws na ccc^Ts SEEKS SKSS5 SKSK ESSES 


60 
120 
180 
240 
300 
360 
420 
4 80 
540 
600 
660 
720 
780 
640 
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WSNAARMGNG ARAARAARGA YMGNGTNTTY ACNGAYAARA CNWSNGCNAC NGTNATHTGY 900 
MGNAARAAYG CNWSNATHWS NGTNMGNGCN CARGAYMGNT AYTAYWSNWS NWSNTGGWSN 960 
GARTGGGCNW SNGTNCCNTG YWSNTRR 987 
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DESCRIPTION 

interferon Alpha Piasmids And De livery Systems, 
And Methods Of Making And Usi ng The Same 

Related Applications 
5 This application relates to U.S. patent application 

Serial No. 08/949,160, filed October 10, 1997 and 
International patent application No. PCT/US97 /1877 9 , filed 
October 10, 1997, (Lyon & Lyon Docket Nos . 226/285 US and 
PCT, respectively), both of which are related to U.S. patent 
10 application Serial No .. 60/028 , 67 6, filed October 18, 1996, 
(Lyon & Lyon Docket No. 222/086 US), all three of which are 
entitled "IL-12 GENE EXPRESSION AND DELIVERY SYSTEMS AND 
USES'' (by Nordstrom et al . ) . 

This application is also related to U.S. patent 
15 application Serial No. 08/798,974, filed February 11, 1997, 
(Lyon & Lyon Docket No. 224/084 US) and International patent 
application No. PCT /US95 / 17 038 , filed December 28, 1995, 
(Lyon & Lyon Docket No. 210/190 PCT), both of which are 
related to U.S. patent application Serial No. 08/372,213, 
20 filed January 13, 1995, (Lyon & Lyon Docket. No. 210/190 US), 
all three of which are entitled "FORMULATED NUCLEIC ACID 
COMPOSITIONS AND METHODS OF ADMINISTERING THE SAME FOR GENE 
THERAPY" (by Mumper Rolland) . 

Each of the above-mentioned applications are 
25 incorporated herein by reference in their entirety, 
including any drawings . 

Field Of The Invention 

The present invention relates to gene delivery and gene 
therapy, and provides novel nucleic acid constructs for 
30 expression of interferon alpha in a mammal, formulations for 
delivery that incorporate a nucleic acid construct for 
expression, and methods for preparing and using such 
constructs and formulations. In particular, this invention 
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relates to plasmid constructs for delivery of therapeutic 
interferon alpha encoding nucleic acids to cells in order to 
modulate tumor activity, methods of using those constructs 
(including combination therapy with other agents, such as 
cytokines, preferably IL-12) , as well as methods for 
preparing such constructs. 

Background Of The Invention 

The following discussion of the background of the 
invention is merely provided to aid the reader in 
understanding the invention and is not admitted to describe 
or constitute prior art to the present invention. 

Plasmids are an important element in genetic engi- 
neering and gene therapy. Plasmids are usually circular DNA 
molecules that can be introduced into bacterial cells by 
15 transformation which replicate autonomously in the cell. 
Plasmids typically allow for 'the amplification of cloned 
DNA. Some plasmids are present in 20 to 50 copies during 
cell growth, and after the arrest of protein synthesis, as 
many as 1000 copies per cell of a plasmid can be generated. 
20 Suzuki et al . , Genetic Analysis, p. 404 , 1989. 

Current non-viral approaches to human gene therapy 
require that a potential therapeutic gene be cloned into 
plasmids. Large quantities of a bacterial host harboring 
the plasmid may be fermented and the plasmid DNA may be 
25 purified for subsequent use. Current human clinical trials 
using plasmids utilize this approach. Recombinant DNA 
Advisory Committee Data Management Report, December, 1994, 
Human Gene Therapy 6:535-548. Studies normally focus on the 
therapeutic gene and the elements that control its 
30 expression in the patient when designing and constructing 
gene therapy plasmids. Generally, therapeutic genes and 
regulatory elements are simply inserted into existing 
cloning vectors that are convenient and readily available. 

Plasmid design and construction utilizes several key 
35 factors. First, plasmid replication origins determine 
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plasmid copy number, which affects production yields. 

Plasmids that replicate to higher copy number can increase 

plasmid yield from a given volume of culture, but excessive 

copy number can be deleterious to the bacteria and lead to 
5 undesirable effects (Fitzwater, et all, Embo J. 7:3289-3297 
(1988); Uhlin, et al., Mol. Gen. Genet. 165:167-179 (1979)). 

Artificially constructed plasmids are sometimes unstably . 

maintained, leading to accumulation of plasmid-free cells 

and reduced production yields. 
10 To overcome this problem of plasmid-free cells, genes 

that code for antibiotic resistance phenotype are included 
on the plasmid and antibiotics are added to kill or inhibit 
plasmid-free cells. Most general purpose cloning vectors 
contain ampicillin resistance (p-lactamase, or bla) genes. 
15 Use of ampicillin can be problematic because of the 
potential for residual antibiotic in the purified DNA, which 
can cause an allergic reaction in a treated patient. In 
addition, p-lactam antibiotics are clinically important .for 
disease treatment. When ' plasmids containing antibiotic 
20 resistance genes are used, the potential exists for the 
transfer of antibiotic resistance genes to a potential 
pathogen. 

Other studies have used the neo gene which is derived 
from the bacterial transposon Tnb . The neo gene encodes 
25 resistance to kanamycin and neomycin (Smith, Vaccine 
12:1515-1519 (1994)). This gene has been used in a number 
of gene therapy studies, including several human clinical 
trials (Recombinant DNA Advisory Committee Data Management 
Report, December, 1994, Human Gene Therapy 6:535-548). Due 
30 to the mechanism by which resistance is imparted, residual 
antibiotic and transmission of the gene to potential 
pathogens may be less of a problem than for (i-lactams. 

In addition to elements that affect the behavior of the 
plasmid within the host bacteria, such as E. col J., plasmid 
35 vectors have also been shown to affect transfection and 
expression in eukaryotic cells. Certain plasmid sequences 
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have been shown to reduce expression of eukaryotic genes in 
eukaryotic cells when carried in cis (Peterson, et al., Mol 
Cell. Biol. 7:1563-1567 (1987); Yoder et al., Mol . Cell 
Biol. 3:956-959 (1983); Lusky et al . , Nature 293:79-81 
5 (1981); and Leite, et al . , Gene 82:351-356 (1989)). Plasmid 
sequences also have been shown to fortuitously contain 
binding sites for transcriptional control proteins (Ghersa, 
et al.. Gene 151:331-332 (1994); Tully et al., Biochem. 
Biophys. Res, Comm. 144:1-10 (1987); and Kushner, et al., 
10 Mol. Endocrinol. 8:405-407 (1994)). This can cause inappro- 
priate levels of gene expression in treated patients. 

Interferon alpha is a gene product that has been 
proposed for use, either alone or in combination with other 
agents, in different delivery systems for the treatment of 
15 certain diseases, including particular cancers. 

International patent publication WO/97/00085, published 
January 3, 1997, proposes ex vivo transfection of tumor 
cells with interferon alpha and another immomodulatory 
molecule, such as IL-12. None of the previously proposed 
treatments have proven entirely satisfactory, due in part to 
the high cost and technical difficulty involved in ex vivo 
approaches. Thus there still remains a need in the art for 
improved plasmids encoding interferon alpha as well 
improved treatment protocols and technologies. 


20 


as 


25 Summary 

The present invention relates to gene delivery and gene 
therapy, and provides novel nucleic acid constructs for 
expression of interferon alpha in a mammal, formulations for 
delivery that incorporate a nucleic acid construct for 
30 expression, and methods for preparing and using such 
constructs and formulations. In particular, this invention 
relates to plasmid constructs for delivery of therapeutic 
interferon alpha encoding nucleic acids to cells in order to 
modulate tumor activity, methods of using those constructs 
(including combination therapy with other agents, such as 


35 
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cytokines, preferably IL-12) , as well as methods for 
preparing such constructs. The pharmaceutical acceptable, 
cost effective and highly efficient delivery system 
presented herein represents an unanticipated improvement 
5 over the art. 

Thus, in a first aspect, the invention features a 
plasmid that contains a CMV promoter and optionally a 
synthetic 5' intron transcriptionally linked with an 
interferon alpha coding sequence, and a 3 ' -untranslated 
10 region (UTR) . Preferably the 3' UTR is a 3 1 growth hormone 
UTR. 

As used herein, the term "plasmid" refers to a 
construct made up of genetic material (i.e., nucleic acids). 
It includes genetic elements arranged such that an inserted 

15 coding sequence can be transcribed in eukaryotic cells. 
Also, while the plasmid may include a sequence from a viral 
nucleic acid, such viral sequence does not cause the 
incorporation of the plasmid into a viral particle, and the 
plasmid is therefore a non-viral vector. Preferably a 

20 plasmid is a closed circular DNA molecule. 

"Cytomegalovirus promoter" refers to one or more 
sequences from a cytomegalovirus which are functional in 
eukaryotic cells as a transcriptional promoter and an 
upstream enhancer sequence. The enhancer sequence allows 

25 transcription to occur at a higher frequency from the 
associated promoter. 

In this context, "transcriptionally linked" means that 
in a system suitable for transcription, transcription will 
initiate under the direction of the control sequence (s) and 

30 proceed through sequences which are transcriptionally linked 
with that control sequence (s). Preferably no mutation is 
created in the resulting transcript, which would alter the 
resulting translation product. 

The term "coding region" or "coding sequence" refers to 

35 a nucleic acid sequence which encodes a particular gene 
product for which expression is desired, according to the 
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normal base pairing and codon usage relationships. Thus, 
the coding sequence must be placed in such relationship to 
transcriptional control sequences (possibly including 
control elements and translat ional initiation and 
5 termination codons) that a proper length transcript will be 
produced and will result in translation in the appropriate 
reading frame to produce a functional desired product. 

In a preferred embodiment the interferon alpha coding 
sequence is for human interferon alpha and preferably is a 

10 synthetic sequence having optimal codon usage, such as the 
nucleotide sequence of SEQ ID NO: 11 or semi-optimal codon 
usage, such as the nucleotide sequence of SEQ ID NO: 12. 

A particular example of coding regions suitable for use 
in the plasmids of this invention are the natural sequences 

15 coding for human interferon alpha. Thus, in a preferred 
embodiment coding region has a nucleotide sequence which is 
the same as SEQ ID NO: 10, which is the natural nucleotide 
sequence encoding human interferon alpha. However, it may 
be preferable to have an interferon alpha coding sequence 

20 which is a synthetic coding sequence. In a preferred 
embodiment, the interferon alpha coding sequence is a 
synthetic sequence utilizing optimal or semi-optimal codon 
usage, preferably the sequence shown in SEQ ID NO: 11 or SEQ 
ID NO:12. 

25 Thus, a "sequence coding for the human interferon 

alpha" or "a human interferon alpha coding sequence" is a 
nucleic acid sequence which encodes the amino acid sequence 
of human interferon alpha, based on the normal base pairing 
and translational codon usage relationships. It is 

30 preferable that the coding sequence encode the exact, full 
amino acid sequence of natural human interferon, but this is 
not essential. The encoded polypeptide may differ from 
natural human interferon alpha, so long as the polypeptide 
retains interferon alpha activity, preferably the 

35 polypeptide is at least 50%, 75%, 90%, or 97% as active as 
natural human interferon alpha, and more preferably fully as 
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active as natural human interferon alpha. Thus, the 

polypeptide encoded by the interferon alpha coding sequence 
may differ from a natural human interferon alpha polypeptide 
by a small amount, preferably less than a 15%, 10%, 5%, or 

5 1% change. Such a change may be of one of more different 
types, such as deletion, addition, or substitution of one or 
more amino acids. 

The term "transcriptional control sequence" refers to 
sequences which control the rate of transcription of a 
10 transcriptionally linked coding region. Thus, the term can 
include elements such as promoters, operators, and 
enhancers. For a particular transcription unit, the 

transcriptional control sequences will include at least a 
promoter sequence. 

15 a "growth hormone 3' untranslated region" is a sequence 

located downstream (i.e., 3') of the region encoding 
material polypeptide and including at least part of the 
sequence of the natural 3' UTR/poly(a) signal from a growth 
hormone gene, preferably the human growth hormone gene. 

20 This region is generally transcribed but not translated. 
For expression in eukaryotic cells it is generally 
preferable to include sequence which signals -the addition of 
a poly-A tail. As with other synthetic genetic elements a 
synthetic 3* UTR/poly (A) signal has a sequence which differs 

25 from naturally-occurring UTR elements. 

The sequence may be modified, for example by the 
deletion of ALU repeat sequences. Deletion of such ALU 
repeat sequences acts to reduce the possibility of 
homologous recombination between the expression cassette and 

30 genomic material in a transfected cell. 

The plasmid preferably includes a promoter, a TATA box, 
a Cap site and a first intron and intron/exon boundary in 
appropriate relationship for expression of the coding 
sequence. The plasmid may also include a 5' mRNA leader 

35 sequence inserted between the promoter and the coding 
sequence and/or an intron/5' UTR from a chicken skeletal a- 
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actin gene. Also, the plasmid may have a nucleotide 
sequence which is the same as the nucleotide sequence of 
plasmid pIF0921, as shown in Figure 5. 

The plasmid may also include: (a) a first transcription 
5 unit containing a first transcriptional control sequence 
transcriptionally linked with a first 5 - -untranslated 
region, a first intron, a first coding sequence, and a first 
3 '-untranslated region/poly (A) signal, wherein the first 
intron is between the control sequence and the first coding 
10 sequence; and (b) a second transcription unit containing a 
second transcriptional control sequence transcriptionally 
linked with a second 5 ' -untranslated region, a second 
intron, a second coding sequence, and a second 3'- 
untranslated region/poly (A) signal, wherein the second 
15 intron is between the control sequence and the second coding 
sequence; wherein the first and second coding sequences 
contain a sequence having the sequence of SEQ ID NO:2, 3, 4 
or 25 coding for a human IL-12 p4 0 subunit, and a sequence 
having the sequence of SEQ ID NO: 6, 7, 8 or 24 coding for a 
human IL-12 p35 subunit. 

The term "transcription unit" or "expression cassette" 
refers to a nucleotide sequence which contains at least one 
coding sequence along with sequence elements which direct 
the initiation and termination of transcription. a 
transcription unit may however include additional sequences, 
which may include sequences involved in post-transcriptional 
or post-translational processes. In preferred embodiments, 
the first transcriptional control sequence or the second 
transcriptional control sequence contain one or more 
30 cytomegalovirus promoter sequences. The first and second 
transcriptional control sequences can be the same or 
different . 

A w 5' untranslated region" or "5' UTR" refers to a 
sequence located 3' to promoter region and 5' of the 
35 downstream coding region. Thus, such a sequence, while 
transcribed, is upstream of the translation initiation codon 


20 


25 
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and therefore is not translated into a portion of the 
polypeptide product. 

For the plasmids described herein, one or more of a 
promoter, 5' untranslated region (5* UTR) , the 3* 
5 UTR/poly (A) signal, and introns may be a synthetic sequence . 
In this context the term "synthetic" means that the sequence 
is not provided directly by the sequence of a naturally 
occurring genetic element of that type but rather is an 
artificially created sequence {i.e., created by a person by 

10 molecular biological methods) . While one or more portions 
of such a synthetic sequence may be the same as portions of 
naturally occurring sequences, the full sequence over the 
specified genetic element is different from, a naturally 
occurring genetic element of that type. The use of such 

15 synthetic genetic elements allows the functional 
characteristics of that element to be appropriately designed 
for the desired function- 
Thus, a "synthetic intron" refers to a sequence which 
is not a naturally occurring intron sequence but which will 

20 be removed from an RNA transcript during normal post 
transcriptional processing. Such introns can be designed to 
have a variety of different characteristics, in particular 
such introns can be designed to have a desired strength of 
splice site. 

25 A "subunit" of a therapeutic molecule is a polypeptide 

or RNA molecule which combines with one or more other 
molecules to form a complex having the relevant 
pharmacologic activity. Examples of such complexes include 
homodimers and heterodimers as well as complexes having 

30 greater numbers of subunits. A specific example of a 
heterodimer is IL-12, having the p40 and p35 subunits. 

The "p40 subunit" is the larger of the two subunits of 
the IL-12 heterodimer. Thus, it is capable of association 
with p35 subunit to form a molecule having activities 

35 characteristic of IL-12. Human p40 has the amino acid 
sequence of SEQ ID NO:l. Those skilled in the art will 
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35 


recognize that the molecule may have a number of changes 
from that sequence, such as deletions, insertions or changes 
of one or a few amino acids, while still retaining IL-12 
activity when associated with p35. Such active altered 
molecules are also regarded as p40. 

Conversely, the "p35 subunit" is the smaller of the two 
heterodimeric subunits of IL-12. For humans, p35 has the 
amino acid sequence of SEQ ID NO: 5. As for p40, p35 may 
have a low level of alterations from that sequence while 
still being regarded as p35. 

A particular example of coding regions suitable for use 
in the plasmids of this invention are the"" natural sequences 
coding for the p40 and p35 subunits of human IL-12. Thus, 
in a preferred embodiment the first and second coding 
regions are coding regions for those sequences and are 
preferably in the order p40 then p35 in the 5* to 3' 
direction. 

Thus, a "sequence coding for the p40 subunit of human 
IL-12" is a nucleic acid sequence which encodes the human 
20 p40 subunit as described above, based on the normal base 
pairing and translational codon usage relationships. The 
sequence coding for p35 subunit of human IL-12 is similarly 
defined. 

In a preferred embodiment the sequence coding for the 
25 P 40 subunit of human IL-12 is 5' to the sequence coding for 
the p35 subunit of human IL-12. Those skilled in the art 
will appreciate that the interferon alpha, p35 subunit and 
p40 subunit may all be on a single transcription unit, that 
all three may be on separate transcription units, or that 
30 any two coding sequences may be on one transcription unit 
and the other coding sequence on another transcription unit. 

The plasmid may also contain an intron having variable 
splicing, a first coding sequence, and a second coding 
sequence, wherein the first and second coding sequences 
include a sequence having the sequence of SEQ ID NO: 2, 3, 4 
or 25 coding for a human IL-12 p40 subunit, and a sequence 
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having the sequence of SEQ ID NO: 6, 7, 8 or 24 coding for a 
human IL-12 p35 subunit. 

In a preferred embodiment, the plasmid also has: (a) a 
transcriptional control sequence transcriptionally linked 
5 with a first coding sequence and a second coding sequence; 
(b) a 5' -untranslated region; (c) an intron 5' to the first 
coding sequence; (d) an alternative splice site 3* to the 
first coding sequence and 5 1 to the second coding sequence; 
and (e) a 3 ' -untranslated region/poly (A) signal. The 

10 transcriptional control sequence preferably includes a 
cytomegalovirus promoter sequence. 

In a preferred embodiment, the plasmid also has: (a) a 
transcriptional control sequence transcriptionally linked 
with a first coding sequence, an IRES sequence, a second 

15 coding sequence, and a 3 * -untranslated region/poly (A) 
signal, wherein the IRES sequence is between the first 
coding sequence and the second coding sequence; and (b) an 
intron between the promoter and the first coding sequence; 
wherein the first and second coding sequences include a 

20 sequence having the sequence of SEQ ID NO: 2, 3, 4 or 25 
coding for a human IL-12 p40 subunit, and a sequence having 
the sequence of SEQ ID NO: 6, 7, 8 or 24 coding for a human 
IL-12 p35 subunit. The transcriptional control sequence 
preferably includes a cytomegalovirus promoter sequence and 

25 the IRES sequence preferably is from an encephalomyocarditis 
virus . 

For delivery of coding sequences for gene expression, 
it is generally useful to provide a delivery composition or 
delivery system which includes one or more other components 

30 in addition to the nucleic acid sequences. Such a 

composition can, for example, aid in maintaining the 
integrity of the DNA and/or in enhancing cellular uptake of 
the DNA and/or by acting as an immunogenic enhancer, such as 
by the non-DNA components having an immuno- stimulatory 

35 effect of their own. 
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Thus, in another aspect, the invention features a 
composition containing a plasmid as described above and a 
protective, interactive non-condensing compound (PINC) . 

The PINC enhances the delivery of the nucleic acid 
molecule to mammalian cells m vivo, and preferably the 
nucleic acid molecule includes a coding sequence for a gene 
product to be expressed in the cell. In many cases, the 
relevant gene product is a polypeptide or protein 
Preferably the PINC is used under conditions so that the 
PINC does not form a gel, or so that no gel form is present 
at the time of administration at about 30-40DC. Thus in 
these compositions, the P!NC is present at a concentration 
of 30% (w/v, or less. In certain preferred embodiments, the 

15 C ° nCentrati0n ±S Sti11 *>r example, 20% or less, 

15 10. or less, 5% or less, or 1% or l ess . Thus , tnese 
compositions differ in compound concentration and functional 
effect from uses of these or similar compounds in which the 
compounds are used at higher concentrations, for example in 
the ethylene glycol mediated transf ection of plant 
protoplasts, or the formation of gels for drug or nucleic 
add delivery. I n general, the PINCs are not in gel form in 
the conditions in which they are used as PINCs, though 
certam of the compounds may form gels under some 
conditions. 

25 In connection with the compounds and compositions of 

this invention, the term "protects" or "protective" refers 
to an effect of the interaction between such a compound and 
a nucleic acid such that the rate of degradation of the 
nuclei acid is decreased in a particular environment. Such 
degradation may be due to a variety of different factors, 
which specifically include the enzymatic action of a 
nuclease. The protective action may be provided in 
different ways, for example, by exclusion of the nuclease 
molecules or by exclusion of water. 

35 some compounds which protect a nucleic acid and/or 

prolong the bioavailability of a nucleic acid may also 
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interact or associate with the nucleic acid by 
intermolecular forces and/or valence bonds such as: Van der 
Waals forces, ion-dipole interactions, ion-induced dipole 
interactions, hydrogen bonds, or ionic bonds. These 
5 interactions may serve the following functions: (1) 
Stereoselectively protect nucleic acids from nucleases by 
shielding; (2) facilitate the cellular uptake of nucleic 
acid by "piggyback endocytosis". Piggyback endocytosis is 
the cellular uptake of a drug or other molecule complexed to 
10 a carrier that may be taken up by endocytosis. CV Uglea and 
C Dumitriu-Medvichi, Medical Applications of Synthetic 
Oligomers, In: Polymeric Biomaterials , 'Severian Dumitriu 
ed., Marcel Dekker, Inc., 1993, incorporated herein by 
reference. 

15 to achieve the desired effects set forth it is 

desirable, but not necessary, that the compounds which 
protect the nucleic acid and/or prolong the bioavailability 
of a nucleic acid have amphiphilic properties; that is, .have 
both hydrophilic and hydrophobic regions. The hydrophilic 

20 region of the compounds may associate with the largely ionic 
and hydrophilic regions of the nucleic acid, while the 
hydrophobic region of the compounds may act to retard 
diffusion of nucleic acid and to protect nucleic acid from 
nucleases . 

25 Additionally, the hydrophobic region may specifically 

interact with cell membranes, possibly facilitating 
endocytosis of the compound and thereby also of nucleic acid 
associated with the compound. This process may increase the 
pericellular concentration of nucleic acid. 

30 Agents which may have amphiphilic properties and are 

generally regarded as being pharmaceutically acceptable are 
the following : polyvinylpyrrolidones ; polyvinylalcohols ; 
polyvinylacetates; propylene glycol; polyethylene glycols; 
poloxamers (Pluronics) ; poloxamines (Tetronics) ; ethylene 

35 vinyl acetates; methylcelluloses, hydroxypropylcelluloses, 
hydroxypropylmethylcelluloses; heteropolysaccharides 
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(pectins); chitosans; phosphatidylcholines (lecithins) ; 
miglyols; polylactic acid; polyhydroxybutyric acid; xanthan 
gum. Also, copolymer systems such as polyethylene glycol- 
polylactic acid (PEG-PLA) , polyethylene glycol- 

5 polyhydroxybutyric acid (PEG-PHB), polyvinylpyrrolidone- 
polyvinylalcohol (PVP-PVA), and derivatized copolymers such 
as copolymers of N-vinyl purine (or pyrimidine) derivatives 
and N-vinylpyrrolidone. However, not all of the above 
compounds are protective, interactive, non-condensing 
10 compounds as described below. 

In connection with the protective, interactive, non- 
condensing compounds for these compositions, the term "non- 
condensing" means that an associated nucleic acid is not 
condensed or collapsed by the interaction with the PINC at 
15 the concentrations used in the compositions. Thus, the 
PINCs differ in type and/or use concentration from' such 
condensing polymers. Examples of commonly used condensing 
polymers include polylysine, and cascade polymers (spherical 
polycations) . 

20 Also in connection with such compounds and an 

associated nucleic acid molecule, the term "enhances the 
delivery" means that at least in conditions such that the 
amounts of PINC and nucleic acid is optimized, a greater 
biological effect is obtained than with the delivery of 
nucleic acid in saline. Thus, in cases where the expression 
of a gene product encoded by the nucleic acid is desired, 
the level of expression obtained with the PINC: nucleic acid 
composition is greater than the expression obtained with the 
same quantity of nucleic acid in saline for delivery by a 
method appropriate for the particular PINC/coding sequence 
combination. 

In preferred embodiments of the above compositions, the 
PINC is polyvinyl pyrrolidone (PVP) , polyvinyl alcohol 
(PVA), a PVP-PVA co-polymer, N-methyl-2-pyrrolidone (NM2P) , 
35 ethylene glycol, or propylene glycol. In compositions in 
which a Poloxamer (Pluronics) is used, the nucleic acid is 
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preferably not a viral vector, i.e., the nucleic acid is a 
non-viral vector. 

In other preferred embodiments, the PINC is bound with 
a targeting ligand. Such targeting ligands can be of a 
5 variety of different types, including but not limited to 
galactosyl, residues, fucosal residues, mannosyl residues, 
carntitine derivatives, monoclonal antibodies, polyclonal 
antibodies, peptide ligands, and DNA-binding proteins. The 
targeting ligands may bind with receptors on cells such as 
10 antigen-presenting cells, hepatocytes, myocytes, epithelial 
cells, endothelial cells, and cancer .cells . 

In connection with the association of a targeting 
ligand and a PINC, the term "bound with" means that the 
parts have an interaction with each other such that the 
15 physical association is thermodynamically favored, 
representing at least a local minimum in the free energy 
function for that association. Such interaction may involve 
covalent binding, or non-covalent interactions such as 
ionic, hydrogen bonding, van der Waals interactions, 
20 hydrophobic interactions, and combinations of such 
interactions . 

While the targeting ligand may be of various types, in 
one embodiment the ligand is an antibody. Both monoclonal 
antibodies and polyclonal antibodies may be utilized. 

25 The nucleic acid may also be present in various forms. 

Preferably the nucleic acid is not associated with a 
compounds (s) which alter the physical form, however, in 
other embodiments the nucleic acid is condensed (such as 
with a condensing polymer), formulated with cationic lipids, 

30 formulated with peptides, or formulated with cationic 
polymers . 

In preferred embodiments, the protective, interactive 
non-condensing compound is polyvinyl pyrrolidone, and/or the 
plasmid is in a solution having between 0.5% and 50% PVP, 
35 more preferably about 5% PVP. The DNA preferably is at 
least about 80% supercoiled, more preferably at least about 
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90% supercoiled, and most preferably at least about 95% 
supercoiled. 

In another aspect the invention features a composition 
containing a protective, interactive non-condensing compound 
and a plasmid containing an interferon alpha coding 
sequence. 

In yet another aspect, the invention provides a 
composition containing a plasmid of the invention (or a 
plasmid containing an interferon alpha coding sequence) and 
a cationic lipid with a neutral co-lipid. 

Preferably the cationic lipid is DOTMA and the neutral 
co-lipid is cholesterol (chol) . DOTMA is 1,2-di-o- 

octadecenyl-3-trimethylammonium propane, which is described 
and discussed in Eppstein et al . , U.S. Patent 4,897 355 
15 issued January 30, 1990, which is incorporated herein by 
reference. However, other lipids and lipid combinations may 
be used in other embodiments. A variety of such lipids are 
described in Gao & Huang, 1995, Gene Therapy 2:710-722, 
which is hereby incorporated by reference. 

As the charge ratio of the cationic lipid and the DNA 
is also a significant factor, in preferred embodiments the 
DNA and the cationic lipid are present is. such amounts that 
the negative to positive charge ratio is about 1:3. while 
preferable, it is not necessary that the ratio be 1-3 
Thus, preferably the charge ratio for the compositions "is 
between about 1:1 and 1:10, more preferably between about 
1:2 and 1:5. 

The term "cationic lipid" refers to a lipid which has a 
net positive charge at physiological P H, and preferably 
carries no negative charges at such P H. An example of such 
a lipid is DOTMA. Similarly, "neutral co-lipid" refers to a 
lipid which has is usually uncharged at physiological pH. 
An example of such a lipid is cholesterol. 

Thus, "negative to positive charge ratio" for the DNA 
35 and cationic lipid refers to the ratio between the net 
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negative charges on the DNA compared to the net positive 
charges on the cationic lipid. 

As the form of the DNA affects the expression 
efficiency, the DNA preferably is at least about 80% 
5 supercoiled, more preferably at least about 90% supercoiled, 
and most preferably at least about 95% supercoiled. The 
composition preferably includes an isotonic carbohydrate 
solution, such as an isotonic carbohydrate solution that 
consists essentially of about 10% lactose. In preferred 
10 embodiments, the composition the cationic lipid and the 
neutral co-lipid are prepared as a liposome having an 
extrusion size of about 800 nanometers. * Preferably the 
liposomes are prepared to have an average diameter of 
between about 20 and 800 nm, more preferably between about 
15 50 and 400 nm, still more preferably between about 75 and 
200 nm, and most preferably about 100 nm. Microf luidization 
is the preferred method of preparation of the liposomes. 

In another aspect the invention features a composition 
containing: (a) a first component having a plasmid including 
20 an interferon alpha coding sequence and a cationic lipid 
with a neutral co-lipid, wherein the cationic lipid is DOTMA 
and the neutral co-lipid is cholesterol, wherein the DNA in 
the plasmid and the cationic lipid are present in amounts 
such that the negative to positive charge ratio is about 
25 1:3; and (b) a second component including a protective, 
interactive non-condensing compound, wherein the first 
component is present within the second component. 

In another aspect, the invention provides a composition 
having a protective, interactive non-condensing compound, a 
30 first plasmid including an interferon alpha coding sequence, 
and one or more other plasmids independently having an IL-12 
p35 or IL-12 p40 subunit coding sequence. 

In another aspect, the invention features a method for 
making any of the plasmids described above by inserting a 
35 CMV promoter transcriptionally linked with an interferon 


BNSDOCID <WO 994767BA3 IA> 


WO 99/47678 


PCT/US99/05394 


18 


10 


15 


alpha coding sequence, and a growth hormone 3 ■ -untranslated 
region into a plasmid. 

The invention also provides methods of making the 
compositions described above. The method may involve- (a) 
preparing a DMA molecule having a transcriptional unit 
wherein the transcriptional unit contains an interferon 
alpha coding sequence; (b) preparing a protective, 

interactive non-condensing compound; and (c) combining the 
protective, interactive non-condensing compound with the DNA 
in conditions such that a composition capable of delivering 
a therapeutically effective amount of an interferon alpha 
coding sequence to a mammal is formed. 

Preferably, the DNA molecule is a plasmid, wherein the 
Plasmid includes a CMV promoter transcriptionally linked 
with an interferon alpha coding sequence, and a human growth 
hormone 3 • -untranslated region/poly (A) signal. 

The method may involve the steps of: (a) preparing a 
DNA having an interferon alpha coding sequence; (b) 
preparing a mixture of a cationic lipid and a neutral co- 
lipid, wherein the cationic lipid is DOTMA and the neutral 
co-lipid is cholesterol; and (c) combining the mixture with 
the DNA in amounts such that the cationic lipid and the DNA 
are present in a negative to positive charge ratio of about 

25 in another embodiment, the method involves the steps 

of: (a) preparing a first component having a plasmid 
containing an interferon alpha coding sequence and a 
cationic lipid with a neutral co-lipid, wherein the cationic 
lipid is DOTMA and the neutral co-lipid is cholesterol, 
30 wherein the DNA in the plasmid and the cationic lipid are 
present in amounts such that the negative to positive charge 
ratio is about 1:3; (b) preparing a second component having 
a protective, interactive non-condensing compound; and (c) 
combining the first and second components such that the 
resulting composition includes the first component within 
the second component. 
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In another embodiment, the method involves the steps 
of: (a) preparing a protective, interactive non-condensing 
compound, (b) preparing a first plasmid having an interferon 
alpha coding sequence, (c) preparing one or more other 
5 plasmids independently having an IL-12 p35 or IL-12 p40 
subunit coding sequence, and (d) combining the protective, 
interactive non-condensing compound, the plasmid having the 
interferon alpha coding sequence and the other plasmids. 

In another aspect, the invention provides a method for 
10 treatment of a mammalian condition or disease, by 
administering to a mammal suffering from the condition or 
disease a therapeutically effective amount of a plasmid as 
described herein. 

A "therapeutically effective amount" of a composition 
15 is an amount which is sufficient to cause at least temporary 
relief or improvement in a symptom or indication of a 
disease or condition. Thus, the amount is also sufficient 
to cause a pharmacological effect. The amount of the 
composition need not cause permanent improvement or 
20 improvement of all symptoms or indications. A 
therapeutically effective amount of a cancer therapeutic 
would be one that reduces overall tumor burden in the case 
of metastatic disease (i.e., the number of metasteses or 
their size) or one that reduces the mass of the tumor in 
25 localized cancers. 

The condition or disease preferably is a cancer, such 
as epithelial glandular cancer, including adenoma and 
adenocarcinoma; squamous and transitional cancer, . including 
polyp, papilloma, squamous cell and transitional cell 
carcinoma; connective tissue cancer, including tissue type 
positive, sarcoma and other (oma's); hematopoietic and 
lymphoreticular cancer, including lymphoma, leukemia and 
Hodgkin's disease; neural tissue cancer, including neuroma, 
sarcoma, neurofibroma and blastoma; mixed tissues of origin 
35 cancer, including teratoma and teratocarcinoma. Other 
cancerous conditions that are applicable to treatment 
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include cancer of any of the following: adrenal gland, anus, 
bUe duct, bladder, brain tumors: adult, breast, cancer of 
an unknown primary site, carcinoids of the gastrointestinal 
tract, cervix, childhood cancers, colon and rectum, 
5 esophagus, gall bladder, head and neck, islet cell and other 
pancreatic carcinomas, Kaposi's sarcoma, kidney, leukemia, 
liver, lung: non-small cell, lung: small cell, lymphoma: 
AIDS-associated, lymphoma: Hodgkin's disease. Lymphomas: 
non-Hodgkin's disease, melanoma, mesothelioma, metastatic 
10 cancer, multiple myeloma, ovary, ovarian germ cell tumors 
pancreas, parathyroid, penis, pituitary, prostate, sarcomas 
of bone and soft tissue, skin, small intestine, stomach, 
testis, thymus, thyroid, trophoblastic disease, uterus- 
endometrial carcinoma, uterus: uterine sarcomas, vagina, or 
vulva. The composition preferably is administered by 
miction, although the method may also be performed ex 
va vo. 

In another aspect, the invention provides a method for 
transection (i.e., the delivery and expression of a gene to 
20 cells) of a cell in situ, by contacting the cell with a 
plasmid described herein for sufficient time to transfect 
the cell. Transfection of the cell preferably is performed 
in vivo and the contacting preferably is performed in the 
presence of about 5% PVP solution. 

In another aspect, the invention features a method for 
delivery and expression of an interferon alpha gene in a 
Plurality of cells, by: (a) transfecting the plurality of 
cells with a plasmid or composition of the invention; 


25 
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and 


(b incubating the plurality of cells under conditions 
allowing expression of a nucleic acid sequence in the 
vector, wherein the nucleic acid sequence encodes interferon 
alpha . 

In preferred embodiments, the interferon alpha is human 
interferon alpha and the cells are human cells and/or the 
contacting is performed in the presence of an about 5% PVP 
solution . 
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in another aspect, the invention features a method for 
treating a disease or condition, by transfecting a cell in 
situ with a plasmid or composition of the invention. The 
disease or condition can be a localized disease or condition 
5 or a systemic disease or condition. 

In another aspect, the invention features a cell 
transfected with a plasmid or composition of the invention. 

in yet another aspect, the invention features a method 
for treatment of a mammalian condition or disease, by 
10 administering to a mammal suffering from the condition or 
disease a therapeutically effective amount of a composition 

described herein. 

As the compositions are useful for delivery of a 
nucleic acid molecule to cells in vivo, in a related aspect 
15 the invention provides a composition at an in vivo site of 
administration. In particular this includes at an in vavo 

site in a mammal . 

In preferred embodiments the nucleic acid molecule 
includes a sequence encoding a gene product. Also, in 
20 preferred embodiments, the site of administration is in an 
interstitial space or a tissue of an animal, particularly of 
a mamma 1 . 

The invention also provides methods for using the above 
compositions. Therefore, in further related aspects, 

25 methods of administering the compositions are provided in 
which the composition is introduced into a mammal, 
preferably into a tissue or an interstitial space. 

Various methods of delivery may be utilized, such as 
are known in the art, but in preferred embodiments, the 
30 composition is introduced into the tissue or interstitial 
space by injection. The compositions may also be delivered 
to a variety of different tissues, but in preferred 
embodiments the tissue is muscle or tumor. 

in another related aspect, the invention provides 
35 methods for treating a mammalian condition or disease by 
administering a therapeutically effective amount of a 
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composition as described above. m preferred embodiments, 
the disease or condition is a cancer. 

The summary of the invention described above is „ on - 
luting and other features and advantages of the invention 
will be apparent from the following detailed description of 
the preferred embodiments, as well as from the claims. 

Brief Descr iption Of The Drawings 

Figure 1 shows the effects of interferon alpha in two 
cancer models. 

10 „„ 10 Fi9UrS 2 Sh ° WS a Pl"">id map and sequence (SEQ ID 
NO: 18) for an exemplary IL-12 piasmid - of tne 

invention. 

Figure 3 shows optimal codon usage for highly expressed 
nuinan genes . 

15 . _ _ iQ FigUre 4 sh °™ * Piasmid map and sequence (SEQ ID 
NO: 19) for piasmid pIF0836, an exemplary interferon alpha 
piasmid of the present invention. 

Figure 5 shows a piasmid map and sequence (SEQ ID 
NO: 20, for P IN096, an exemplary I L -l 2 plasmid that can fae 
used with the present invention. 

Figure 6 shows the nucleic acid sequence (SEQ ID NO-21) 
of piasmid PIF0921, an exemplary interferon alpha piasmid of 
tne present invention. 

rn .JtT^ 5 ?A 78 Sh ° W 3 PlaSmid map and ^«3uence (SEQ 

NO:22) for piasmid pIF0921. 

Figure 8 shows an outline of a strategy that can be 
used to synthesize a pIF0921 piasmid. 

Figure 9 shows interferon alpha and IL-12 gene medicine 
(combination therapy) in Renca model. 
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Detailed Description O f The Prefe rred Emb n Hi.pn^ 

The present invention relates to gene delivery and gene 
therapy, and provides novel nucleic acid constructs for 
expression of interferon alpha in a mammal, formulations for 
delivery that incorporate a nucleic acid construct for 
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expression, and methods for preparing and using such 
constructs and formulations. In particular, this invention 
relates to plasmid constructs for delivery of therapeutic 
interferon alpha encoding nucleic acids to cells in order to 
5 modulate tumor activity, methods of using those constructs 
(including combination therapy with other agents, such as 
cytokines, preferably IL-12) , as well as methods for 
preparing such constructs. 

I . General 

10 As described, this invention concerns expression 

systems for the delivery and expression of interferon alpha 
coding sequences in mammalian cells, and formulations and 
methods for delivering such expression systems or other 
expression systems to a mammal. 
15 Therefore, particular genetic constructs are described 

which includes nucleotide sequences coding for interferon 
alpha, preferably human interferon alpha. Such a construct 
can beneficially be formulated and administered as described 
herein, utilizing the expression systems of this invention. 
20 To allow convenient production of such plasmids, it is 

generally preferable that the plasmid. be capable of 
replication in a cell to high copy number. Generally such 
production is carried out in prokaryotic cells, particularly 
including Esherichia coll {E.coli) cells. Thus, the plasmid 
25 preferably contains a replication origin functional in a 
prokaryotic cell, and preferably the replication origin is 
one which will direct replication to a high copy number. 

It is also possible to utilize . synthetic genetic 
elements in the plasmid constructs. 
30 As described below, these elements affect post- 

transcriptional processing in eukaryotic systems. Thus, the 
use of synthetic sequences allows the design of processing 
characteristics as desired for the particular application. 
Commonly, the elements will be designed to provide rapid and 
35 accurate processing. 
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For delivery of DNA encoding a desired expression 
product to a mammalian system, it is usually preferable tQ 
utilize a delivery system. Such a system can provide 
multiple benefits, notably providing stabilization to 
5 protect the integrity of the DNA, as well as assisting in 
cellular uptake. 

In addition, the non-DNA components of the formulation 
may contribute to an immune system enhancement or 
activation. As a result, components of a delivery system 
10 can be selected in conjunction with a particular g ene 
product to enhance or minimize the immuno-stimulatory 
effect. J 

The plasmids may also include elements for expression 
of IL-12 or one or more subunits thereof. Similarly, the 
treatment may involve administration of an interferon alpha 
coding sequence and one or more IL-12 coding sequences 
whether on a single plasmid or on separate plasmids. Such 
plasmids may be incorporated into compositions for delivery 
with a protective, interactive non-condensing compound, a 
cationic lipid and neutral co-lipid, or both. 

While these are specific effective examples, other 
components may be utilized in formulations containing the 
interferon alpha expression vectors of the present invention 
to provide effective delivery and expression of interferon 
alpha or with other coding sequences for which manipulation 
of the activation of immune system components is desirable 

The selection of delivery system components and 
preparation methods in conjunction with the selection of 
coding sequences provides the ability to balance the 
specific effects of the encoded gene products and the immune 
system effects of the overall delivery system composition. 
This capacity to control the biological effects of delivery 
system formulation administration represents an aspect of 
the invention in addition to the interferon alpha encoding 
constructs and specific formuiations of delivery systems 
Co-selection of the encoded gene product and the delivery 
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system components and parameters provides enhanced desired 
effects rather than merely providing high level gene 
expression. In particular, such enhancement is described 
below for the antitumor effects of the exemplary PVP 
5 containing compositions. 

For systems in which IL-12 is also administered, the 
antitumor effect can be greater than merely additive (i.e., 
greater than merely the sum of the antitumor effects of 
interferon alpha alone and IL-12 alone) . Enhancement of 

10 immuno-stimulatory effects is also desirable in other 
contexts, for example, for vaccine applications. 

In contrast, for certain applications", it is preferable 
to select a delivery systems which minimizes the immune 
system effects. For example, it is often preferred that the 

15 immune system activation be minimized for compositions to be 
delivered to the lung in order to minimize lung tissue 
swelling. 

A useful approach for selecting the delivery system 
components and preparation techniques for a particular 
20 coding sequence can proceed as follows, but is not limited 
to these steps or step order. 

1. Select a particular genetic construct which 
provides appropriate expression in vitro. 

2. Select delivery system components based on desired 
25 immunostimulatory effects and/or on in vivo 

physiological effect. Such effects can be tested 
or verified in in vivo model systems. 

3. Select the other delivery system parameters 
consistent with the desired immuno-stimulatory 

30 effects and/or consistent with enhancing the 

desired in vivo physiological effect. Such 
parameters can, for example, include the amount 
and ratio of DNA to one or more other composition 
components, the relative amounts of non-DNA 

35 composition components, the size of delivery 

system formulation particles, the percent 
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supercoiled DNA for circular dsDNA vectors, and 
the specific method of preparation of delivery 
system formulation particles. The particular 
parameters relevant for specific types of 
5 formulations will be apparent or readily 

determined by testing. 
The description below illustrates the selection of 
components and parameters in the context of interferon alpha 
encoding constructs. However, it should be recognized that 
10 the approach is applicable to constructs containing a 
variety of other coding sequences. 

11 * Plasmid Construct Expression Systems 
A * Plasmid Design and Construction 

For the methods and constructs of this invention, a 
15 number of different plasmids were constructed which are 
useful for delivery and expression of sequences encoding 
interferon alpha. Thus, these plasmids contain coding 
regions for interferon alpha, along with genetic elements 
necessary or useful for expression of those coding regions. 

While these embodiments utilized interferon alpha cDNA 
clones or partial genomic sequences from a particular 
source, those skilled in the art could readily obtain 
interferon alpha coding sequences from other sources, or 
obtain a coding sequence by identifying a cDNA clone in a 
library using a probe (s) based on the published interferon 
alpha coding and/or flanking sequences. This also applies 
to the IL-12 coding sequences utilized in the embodiments 
described herein. 

Coding sequences for interferon alpha were incorporated 
30 into an expression plasmid that contains eukaryotic and 
bacterial genetic elements. Eukaryotic genetic elements 
include the CMV immediate early promoter and 5' UTR, and a 
human growth hormone 3' UTR/poly(a) signal, which influence 
gene expression by controlling the accuracy and efficiency 
35 of RNA processing, mRNA stability, and translation. 
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The human growth hormone 3 1 UTR is from a human growth 
hormone gene, and preferably includes a poly (a) signal. 
This sequence can be linked immediately following the 
natural translation termination codon for a cDNA sequence, 
genomic sequence, modified genomic sequence, or synthetic 
sequence coding for interferon alpha. 

An example of a human growth hormone 3' UTR/poly(a) 
signal is shown by the human growth hormone 3' UTR 

(nucleotides 1 - 100) and 3' flanking sequence (nucleotides 
101 - 191; GenBank accession #J03071, HUMGHCSA) below. 

(SEQ ID NO:13) 

1 GGGTGGCATCCCTGTGACCCCTCCCCAGTGCCTCTCCTGGCCCTGGAAGT 

Poly (a) signal 

5 1 TGCCACTCCAGTGCCCACCAGCCTTGTCCTAATAAAATTAAGTXGCATCA 

101 TTTTGTCTGACTAGGTGTCCTTCTATAATATT ATGGGGTGGAGGGGGGTG , 

151 GTATGGAGCAAGGGGCAAGTTGGGAAGACAACCTGTAGGGC 

The 5' and 3' UTR and flanking regions can be further 
and more precisely defined by routine methodology, e.g., 
deletion or mutation analysis or their equivalents., and can 
be modified to' provide other sequences having appropriate 
transcriptional and translational functions. 

1. Construction of plasmid: Plasmid Backbone, human 
interferon alpha cDNA, Final Construct 

A diagrammatic representation of the PGR products and 
plasmids involved in creation of an exemplary construct is 
shown below in Figure 8 . 

Plasmid pIF0921 was constructed from commercially 
available plasmids, and contains the TN5 gene encoding the 
kanamycin resistance gene, the pUC origin of replication, 
the CMV enhancer and promoter to base +112, a synthetic 
intron called IVS8, the human IFN-a2b gene, and the human 
growth hormone 3' UTR. The plasmid construction descendancy 
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for pIL0697 is shown in Figure 8. pIL0697 was cut with 
BamHI and Xba I and the hIFN-a2b PGR product, which had been 
amplified from human genomic DNA with BamHI and Xba I ends, 
was cloned into the pIL0697 backbone in place of the IL-2 
5 coding region. The resulting plasmid was pIF0863. pIF0863 
was cut with Nco I and intron IVS8 from pCT0828 was cloned 
in. The resulting plasmid was pIF0890. pIF0890 was cut 
with Nde I and Pac I and an additional region of the CMV 5' 
UTR to base +112 was cloned in from plasmid pLC0888. 

10 B - Synthetic Genetic Elements 

In some embodiments, some or all" of the genetic 
elements can be synthetic, derived from synthetic 
oligonucleotides, and thus are not obtained directly from 
natural genetic sequences. These synthetic elements are 
15 appropriate for use in many different expression vectors. 

A synthetic intron is designed with splice sites that 
ensure that RNA splicing is accurate and efficient A 
synthetic 3' UTR/poly(A) signal is designed to facilitate 
mRNA 3* end formation and mRNA stability. A synthetic 5' 
20 UTR is designed to facilitate the initiation of translation. 
The design of exemplary synthetic elements is described in 
more detail below. 

1 ' Summary of Synthetic Element Features 
Exemplary synthetic 5 'UTR, intron, and 3 1 UTR/poly (A) 
25 signal have the general features shown below: 

5' UTR Short. 

Lack of secondary structure. 

Kozak sequence. 

Site for intron insertion. 
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Intron 5' splice site sequence matches 

consensus. 

5' splice site sequence is exactly 
complementary to 5 1 end of Ul 
snRNA. 

Branch point sequence matches 
consensus . 

Branch point sequence is 
complementary to U2 snRNA. 
3* splice site matches consensus. 
Polypyrimidine tract is 16 bases in 
length and contains 7 consecutive 
T's. (The tract preferably 
contains at least 5 consecutive 
T' s. ) 

Contains internal restriction 
enzyme sites. 

Bbsl cleaves at the 5*ss, Earl 
cleaves at the 3'ss. 

3' UTR/Poly(A) Based on rabbit p-globin 3' 

UTR/poly(A) signal. 

Consists of two poly (A) signals in 
tandem. 

2. Features of the Synthetic 5'UTR (UT6) : 
The 5' untranslated region (5* UTR) influences the 
translational efficiency of messenger RNA, and is therefore 
5 an important determinant of eukaryotic gene expression. The 
synthetic 5 ' UTR sequence (UT6) has been designed to 
maximize the translational efficiency of mRNAs encoded by 
vectors that express genes of therapeutic interest. 

The sequence of the synthetic 5' UTR (UT6) is shown 
10 below. The Kozak sequence is in boldface and the initiation 
codon is double underlined. The location of the intron 
(between residues 48 and 4 9) is indicated by the filled 
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triangle and the sequences that form the exonic portion of 
consensus splice sites are single underlined. The 
restriction sites for Hindlll and Ncol are overlined. (SEQ 
ID NO: 14) 

5 

HindI11 V Ncol 

AAGCTTACTCAACACAATAACAAACTTACTTACAATCTTAATTAA CAGG CCACCATGG 

The 5' untranslated region (5' UTR) , located between 
10 the cap site and initiation codon, is known to influence the 
efficiency of mRNA translation. Any features that influence 
the accessibility of the 5' cap structure to initiation 
factors, the binding and subsequent migration of the 43S 
preinitiation complex, or the recognition of the initiation 
15 codon, will influence mRNA translatability . An efficient 5 1 
UTR is expected to be one that is moderate in length, devoid 
of secondary structure, devoid of upstream initiation 
codons, and has an AUG within an optimal local context 
(Kozak, 1994, Biochimie 76:815-821; Jansen et al . , 1994). a 
20 5* UTR with these characteristics should allow efficient 
recognition of the 5' cap structure, followed by rapid and 
unimpeded ribosome scanning by the ribosome, thereby 
facilitating the translation initiation process. 

The sequence of the synthetic 5 ' UTR was designed to be 
moderate in length (54 nucleotides (nts) ) , to be deficient 
in G but rich in C and A residues, to lack an upstream ATG, 
to place the intended ATG within the context of a optimal 
Kozak sequence (CCACCATGG) , and to lack potential secondary 
structure. The synthetic 5' UTR sequence was also designed 
to lack AU-rich sequences that target mRNAs to be rapidly 
degraded in the cytoplasm. 

Experiments have demonstrated that introns increase 
gene expression from cDNA vectors, and that introns located 
in the 5' UTR are more effective than ones located in the 3' 
35 UTR (Huang and Gorman, 1990, Mol . Cell. Biol. 10:1805-1810; 
Evans and Scarpulla, 1989, Gene 84:135-142; Brinster et al., 
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1988, Proc. Natl. Acad, Sci . USA 85:836-840; Palmiter et 
al., 1991, Proc. Natl. Acad. Scl. USA 88:478-482; Choi et 
al., 1991, Mol. Cell. Biol. 11:3070-307 4). Accordingly, the 
synthetic 5" UTR sequence was designed to accommodate an 
5 intron with consensus splice site sequences. The intron 
may, for example, be located between residues. 48 and 49 (See 
intron sequence structure below) . The CAG at position 46-48 
is the exonic portion of a consensus 5' splice site. The G 
at position 49 is the exonic portion of a consensus 3' 
10 splice site. 

To facilitate cloning manipulations, the synthetic 5* 
UTR sequence was designed to begin with a-- Hindlll site and 
terminate with a Ncol site. 

3 . Features of the Synthetic Intron 
15 RNA splicing is required for the expression of most 

eukaryotic genes. For optimal gene expression, RNA splicing 
must be highly efficient and accurate. A synthetic intron, 
termed OPTIVS8B, was designed to be maximally efficient and 
accurate. 

20 The structure of the exemplary synthetic intron, 

OPTIVS8 is shown below. Sequences for the 5' splice site 
(5'ss), branch point (bp), and 3' splice site (3'ss) are 
double underlined. The recognition sequences for the 
restriction enzymes Bbsl and Earl are overlined. The 

25 cleavage site for Bbsl corresponds to the 5'ss, and the 
cleavage site for Earl corresponds to the 3'ss. 

5'ss bp 3'ss 

| Bbsl I Earl I 

30 5' CAG GTAAGT GTCTTC (77) TACTAAC GG TTCTTTT TTTCTCTTCACAG £ 3' 

(SEQ ID NO. 15) (SEQ ID NO. 16) 

The 5* splice site (5'ss) sequence matches the 
established consensus sequence/ MAG □ GTRAGT, where M = C or 
A, and R = G or A. Since the mechanism of splicing involves 
35 an interaction between the 5'ss of the pre-mRNA and Ul 
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snRNA, the 5'ss sequence of OPTIVS8B was chosen to be 
exactly complementary to the 5 1 end of Ul snRNA. 

5'ss 5* CAGGUAAGU 3* 

I I I I I I I I I 
Ul RNA 3' GUCCAUUCA 5* 

In mammals, the consensus sequence for branch points 
(YNYTRAY, where Y = C or T, R = A or G, N = any base, and 
the underlined A residue is the actual branch point) is very 
ambiguous. Since the mechanism of splicing involves an 
interaction between the branch point (bp; of the pre-mRNA 
and U2 snRNA, the branch point sequence of OPTIVS8B was 
chosen to maximize this interaction. (Note that the branch 
15 point itself is bulged out) . The chosen sequence also 
matches the branch point sequence that is known to be 
obligatory for pre-mRNA splicing in yeast. The branch point 
is typically located 18-38 nts upstream of the 3' splice 
site. In OPTIVS8B, the branch point is located 24 nts 
upstream from the 3' splice site. 

BP 5' UACUA^C 3' 

I I I I I I 
U2 RNA 3' AUGAU G 5' 


20 


25 


30 


The sequence of the 3' splice site (3'ss) matches the 
established consensus sequence, YuNYAG i G, where Y = C or 
T, and N = any base. In 3* splice sites, the polypyrimidine 
tract (Y u ) is the major determinant of splice site strength. 
For optimal splice site function in OPTIVS8B, the length of 
the polypyrimidine tract was extended to 16 bases, and its 
sequence was adjusted to contain 7 consecutive T residues. 
This feature was included because Roscigno et al. (1993) 
demonstrated that optimal splicing requires the presence of 
35 at least 5 consecutive T residues in the polypyrimidine 
tract . 
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Splicing in vitro is generally optimal when introns are 
>80 nts in length (Wieringa, et al., 1984; Ulfendahl et al., 
1985, Nucl. Acids Res. 13:6299-6315). Although many introns 
may be thousands of bases in length, most naturally 
occurring introns are 90-200 nt in length (Hawkins, 1988, 
Nucl. Acids Res. 16:9893-9908). The length of the 

synthetic intron (118 nts) falls within this latter range. 

OPTIVS8B was designed with three internal restriction 
enzyme sites, Bbsl, Nhel, and Earl. These restriction sites 
facilitate the screening and identification of genes that 
contain the synthetic intron sequence. In addition, the 
Bbsl and Earl sites were placed so that their cleavage sites 
exactly correspond to the 5'ss (Bbsl) or 3'ss (Earl). The 
sequence of the polypyrimidine tract was specificall 
designed to accommodate the Earl restriction site 
Inclusion of the Bbsl and Earl sites at these locations i 
useful because they permit the intron to be precisely 
deleted from a gene. They also permit the generation of an 
-intron cassette" that can be inserted at other locations 
within a gene. 

The 77 bases between the Bbsl site and the branch point 
sequence are random in sequence, except for the inclusion of 
the Nhel restriction site. 

4. Features of the Synthetic 3' UTR/poly(A) 
Signal : 

The 3' ends of eukaryotic mRNAs are formed by the 
process of polyadenylation . This process involves site 
specific site RNA cleavage, followed by addition of a 
poly (A) tail. RNAs that lack a poly (A) tail are highly 
unstable. Thus, the efficiency of cleavage/polyadenylation 
is a major determinant of mRNA levels, and thereby, of gene 
expression levels. 2XPA1 is a synthetic sequence, containing 
two efficient poly (A) signals, that is designed to be 
maximally effective in polyadenylation. 
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A poly (A) signal is required for the formation of the 
3» end of most eukaryotic mRNA. The signal directs two RNA 
processing reactions: site-specific endonucleolytic cleavage 
of the RNA transcript, and stepwise addition of adenylates 
(approximately 250) to the newly generated 3' end to form 
the poly (A) tail. A poly (A) signal has three parts: 
hexanucleotide, cleavage site, and downstream element. The 
hexanucleotide is typically AAUAAA and cleavage sites are 
most frequently 3' to the dinucleotide CA (Sheets et al., 
1987). Downstream elements are required for optimal poly (A) 
signal function and are located downstream of the cleavage 
site. The sequence requirement for downstream elements is 
not yet fully established, but is generally viewed as UG- or 
U-rich sequences (Wickens, 1990; Proudfoot, 1991, Cell 
64:671-674; Wahle, 1992, Bioessays 14:113-118; Chen and 
Nordstrom, 1992, Nucl . Acids Res. 20:2565-2572). 

Naturally occurring poly (A) signals are highly variable 
in their effectiveness (Peterson, 1992). The effectiveness 
of a particular poly (A) signal is mostly determined by the 
quality of the downstream element. (Wahle, 1992). in 
expression vectors designed to express genes of therapeutic 
interest, it is important to have a poly (A) signal that is 
as efficient as possible. 

Poly (A) efficiency is important for gene expression, 
because transcripts that fail to be cleaved and 
polyadenylated are rapidly degraded in the nuclear 
compartment. In fact, the efficiency of polyadenylation in 
living cells is difficult to measure, since 
nonpolyadenylated RNAs are so unstable. In addition to 
being required for mRNA stability, poly (A) tails contribute 
to the translatability of mRNA, and may influence other RNA 
processing reactions such as splicing or RNA transport 
({Jackson and Standart , 1990, Cell 62:15-24; Wahle, 1992). 

Some eukaryotic genes have more than one poly (A) site, 
implying that if the cleavage/polyadenylation reaction fails 
to occur at the first site, it will occur at one of the 
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later sites. In COS cell transfection experiments, a gene 
with two strong poly (A) sites yielded approximately two-fold 
more mRNA than one with a single strong poly (A) site 
(Bordonaro, 1995) . These data suggest that a significant 
fraction of transcripts remain unprocessed even with a 
single "efficient" poly (A) signal. Thus, it may be 

preferable to include more than one poly (A) site. 

The sequence of the exemplary synthetic poly (A) signal 
is shown below. The sequence is named 2XPA. The 
hexanucleotide sequences and downstream element sequences 
are double underlined, and the two poly (A) sites are labeled 
as pA#l and pA#2. Convenient restriction sites are 

overlined. The entire 2XPA unit may be transferred in 
cloning experiments as a Xbal-Kpnl fragment. Deletion of 
the internal BspHI fragment results in the formation of a 
1XPA unit. (SEQ ID NO. 17) 

Xbal - Bs P HI 

TCTAGAGCATTTTTCCCTCTGCCAAAAATTATGGGGACATCATGAAGCCCCTTGAGCATCTGACG 

pA#l 

Hex | Downstream element 

TCTGGCT AATAAA nnAAATTTATTTTCATTGCAAT AGTGTGTTGGAA TTTTTTGTGTCTCTCACT 

BspHI 

CGGT AC TAG AG C ATT T T T C C C TC T GC C AAAAAT T AT GGG G AC AT C ATG AAG C C C C T TG AGC AT C T 

pA#2 . 

Hex | Downstream element 

cnrrT r T rr~Tn3Tn^ 


Kpnl 


CACTCGGTACC 


The sequence of the synthetic poly (A) site shown above 
is based on the sequence of the rabbit D-globin poly (A) 
signal, a signal that has been characterized in the 
literature as strong (Gil and Proudfoot, 1987, Cell 49:399- 
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406; Gil and Proudfoot, 1984, Nature 312:473-474). One of 
its key features is the structure of its downstream element, 
which contains both UG- and U-rich domains. 

A double-stranded DNA sequence corresponding to the 
1XPA sequence was constructed from synthetic 

oligonucleotides. Two copies of the 1XPA sequence were then 
joined to form the 2XPA sequence. The sequences were joined 
in such as way as to have a unique Xbal site at the 5 1 end 
of the first poly (A) signal containing fragment, and a 
unique Kpnl site at the 3' end of the second poly (A) signal 
containing fragment. 

C * Interfer on Alpha and IL-12 Coding Sequences 
The nucleotide sequence of a natural human interferon 
alpha coding sequences is known, and is provided below, 
along with a synthetic sequence which also codes for human 
interferon alpha. The same applies with respect to the 
IL-12 coding sequences. 

In some cases, instead of the natural sequence coding 
for interferon alpha, it is advantageous to utilize 
synthetic sequences which encode interferon alpha. Such 
synthetic sequences have alternate codon usage from the 
natural sequence, and thus have dramatically different 
nucleotide sequences from the . natural sequence. In 
particular, synthetic sequences can be used which have codon 
usage at least partially optimized for expression in a 
human. The natural sequences do not have such optimal codon 
usage. Preferably, substantially all the codons are 

optimized. 

Optimal codon usage in humans is indicated by codon 
usage frequencies for highly expressed human genes, as shown 
in Fig. 3. The codon usage chart is from the program 
"Human_High.cod" from the Wisconsin Sequence Analysis 
Package, Version 8.1, Genetics Computer Group, Madison, WI. 
The codons which are most frequently used in highly 
expressed human genes are presumptively the optimal codons 


WO 99/47678 


PCT/US99/05394 


37 

for expression in human host cells, and thus form the basis 
for constructing a synthetic coding sequence. An example of 
a synthetic interferon alpha coding sequence is shown as the 
bottom sequence in the table below. 
5 However, rather than a sequence having fully optimized 

codon usage, it may be desirable to utilize an interferon 
alpha encoding sequence which has optimized codon usage 
except in areas where the same amino acid is too close 
together or abundant to make uniform codon usage optimal. 
10 In addition, other synthetic sequences can be used 

which have substantial portions of Che codon usage 
optimized, for example, with at least 50%, 70%, 80% or 90% 
optimized codons as compared to a natural coding sequence. 
Other particular synthetic sequences for interferon alpha 
15 can be selected by reference to the codon usage chart in 
Fig. 3. A sequence is selected by choosing a codon for each 
of the amino acids of the polypeptide sequences. - DNA 
molecules corresponding to each of the polypeptides can then 
by constructed by routine chemical synthesis methods. , For 
20 example, shorter oligonucleotides can be synthesized, and 
then ligated in the appropriate relationships to construct 
the full-length coding sequences. 

The following sequences are provided in the sequence 
listing herein: interferon alpha amino acid sequence, SEQ 
25 ID NO: 9; interferon alpha wild type nucleic acid sequence, 
SEQ ID NO: 10; interferon alpha synthetic nucleic acid 
sequence with optimized codon usage, SEQ ID NO: 11; 
interferon alpha nucleic acid sequence with additional/semi- 
optimized codon usage, SEQ ID NO: 12; IL-12 p40 subunit amino 
30 acid sequence, SEQ ID NO:l; IL-12 p40 wild type nucleic acid 
sequence, SEQ ID NO: 2; IL-12 p40 synthetic nucleic acid 
sequence with all codons optimized, SEQ ID NO: 3; IL-12 p4 0 
subunit nucleic acid sequence with all codons optimized 
except when same nucleic acids were too close/abundant, SEQ 
35 ID NO: 4; IL-12 p35 amino acid sequence, SEQ ID NO: 5; IL-12 
p35 wild type nucleic acid sequence, SEQ ID NO: 6; IL-12 p35 
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synthetic nucleic acid sequence with all codons optimized, 
SEQ ID NO: 7; IL-12 p35 subunit nucleic acid sequence with 
all codons optimized except when same nucleic acids were too 
close/abundant, SEQ ID NO: 8. Those skilled in the art will 
realize that various nucleic acid sequences with optimized 
codon usage can be constructed, for example based on the 
various combinations shown below, wherein optimal usage for 
each codon is shown below the IL-12 p35 and P 40 subunit wild 
type sequences and the interferon alpha wild type sequence. 

Sequences Encoding Human IL-12 p35 

First line = natural sequence (SEQ ID NO. 6) 

Second line = all codons optimized (SEQ ID NO. 7) 

Third line = all codons optimized except when same 

nucleic acids were too close/abundant (changes between 

second and third lines bolded) (SEQ ID NO. 8) 

ATG TGT CCA GCG CGC AGC CTC CTC CTT GTG GCT ACC CTG GTC CTC CTG GAC CAC CTC ACT 
ATG TGC CCC GCC CGC AGC CTG CTG CTG GTG GCC ACC CTG GTG CTG CTG GAC CAC CTG AGC 
ATG TGC CCC GCC CGC AGC CTG CTG CTC GTG GCC ACC CTG GTG CTC CTG GAC CAC CTC AGC 

TTG GCC AGA AAC CTC CCC GTG GCC ACT CCA GAC CCA GGA ATG TTC CCA TGC CTT CAC CAC 
CTG GCC CGC AAC CTG CCC GTG GCC ACC CCC GAC CCC. GGC ATG TTC CCC TGC CTG CAC CAC 
CTG GCC CGC AAC CTC CCC GTG GCC ACC CCA GAC CCC GGC ATG TTC CCA TGC CTG CAC CAC 

TCC CAA AAC CTG CTG AGG GCC GTC AGC AAC ATG CTC CAG AAG GCC AGA CAA ACT CTA GAA 
AGC CAG AAC CTG CTG GCG GCC GTG AGC AAC ATG CTG CAG AAG GCC GCG CAG ACC CTG GAG 
AGC CAG AAC CTG CTG GCG GCC GTG AGC AAC ATG CTG CAG AAG GCC GCG CAG ACC CTG GAG 

TTT TAC CCT TGC ACT TCT GAA GAG ATT GAT CAT GAA GAT ATC ACA AAA GAT AAA ACC AGC 
TTC TAC CCC TGC ACC AGC GAG GAG ATC GAC CAC GAG GAC ATC ACC AAG GAC AAG ACC AGC 
TTC TAC CCC TGC ACC AGC GAG GAG ATC GAC CAC GAG GAC ATC ACC AAG GAC AAG ACC AGC 

ACA GTG GAG GCC TGT TTA CCA TTG GAA TTA ACC AAG AAT GAG AGT TGC CTA AAT TCC AGA 
ACC GTG GAG GCC TGC CTG CCC CTG GAG CTG ACC AAG AAC GAG AGC TGC CTG AAC AGC CGC 
ACC GTG GAG GCC TGC CTG CCC CTC GAG TTA ACC AAG AAC GAG AGC TGC CTC AAC AGC CGC 


WO 99/47678 


PCTAJS99/05394 


39 


GAG ACC TCT TTC ATA ACT AAT GGG ACT TGC CTG GCC TCC AGA AAG ACC TCT TTT ATG ATG 
GAG ACC AGC TTC ATC ACC AAC GGC AGC TGC CTG GCC AGC CGC AAG ACC AGC TTC ATG ATG 
GAG ACC TCC TTC ATC ACC AAC GGC ACT TGC CTG GCC TCC CGC AAG ACC AGC TTC ATG ATG 

GCC CTG TGC CTT AGT ACT ATT TAT GAA GAC TTG AAG ATG TAG CAG GTG GAG TTC AAG ACC 
GCC CTG TGC CTG AGC AGC ATC TAC GAG GAC CTG AAG ATG TAG CAG GTG GAG TTC AAG ACC 
GCC CTG TGC CTG AGC TCC ATC TAC GAG GAC CTG AAG ATG TAC CAG GTG GAG TTC AAG ACC 

ATG AAT GCA AAG CTT CTG ATG GAT CCT AAG AGG CAG ATC TTT CTA GAT CAA AAC ATG CTG 
ATG AAC GCC AAG CTG CTG ATG GAC CCC AAG CTC CAG ATC TTC CTG -GAC CAG AAC ATG CTG 
ATG AAC GCC AAG CTC CTG ATG GAC CCC AAG CTC CAG ATC TTC CTG GAC CAG AAC ATG CTG 

GCA GTT ATT GAT GAG CTG ATG CAG GCC CTG AAT TTC AAC AGT GAG ACT GTG CCA CAA AAA 
GCC GTG ATC GAC GAG CTG ATG CAG GCC CTG AAC TTC AAC AGC GAG ACC GTG CCC CAG AAG 
GCC GTG ATC GAC GAG CTG ATG CAG GCC CTG AAC TTC AAC AGC GAG ACC GTG CCC CAG AAG 

TCC TCC CTT GAA GAA CCG GAT TTT TAT AAA ACT AAA ATC AAG CTC TGC ATA CTT CTT CAT 
AGC AGC CTG GAG GAG CCC GAC TTC TAC AAG ACC AAG ATC AAG CTG TGC ATC CTG CTG CAC 
AGC AGC CTG GAG GAG CCC GAC TTC TAC AAG ACC AAG ATC AAG CTG TGC ATC CTG CTG CAC 

GCT TTC AGA ATT CGG GCA GTG ACT ATT GAC AGA GTG ACG AGC TAT CTG AAT GCT TCC TAA 
GCC TTC CGC ATC CGC GCC GTG ACC ATC GAC CGC GTG ACC AGC TAC CTG AAC GCC ACC TGA 
GCC TTC CGC ATC CGG GCC GTG ACC ATC GAC CGC GTG ACC AGC TAC CTG AAC GCC ACG TGA 

Additional Optimized Sequences Coding For IL-12 P 35 Subunit 
(Second Line = SEP ID NO: 24) 


10 


20 


Met Cys Pro Ala Arg Ser Leu Leu Leu Val Ala Thr Leu Val Leu Leu Asp His Leu Ser 
ATG TGY CCN GCN MGN WSN YTN YTN YTN GTN GCN ACN YTN GTN YTN YTN GAY CAY YTN WSN 

ATG TGT CCT GCT CGT TCT TTA TTA TTA GTT GCT ACT TTA GTT TTA TTA GAT CAT TTA TCT 
TGC CCC GCC CGC TCC TTG TTG TTG GTC GCC ACC TTG GTC TTG TTG GAC CAC TTG TCC 
CCA GCA CGA TCA CTT CTT CTT GTA GCA ACA CTT GTA CTT CTT CTT TCA 

CCG GCG CGG TCG CTC CTC CTC GTG GCG ACG CTC GTG CTC CTC CTC TCG 
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AGA ACT CTA CTA CTA C TA CTA CTA CTA ACT 

AGG AGC CTG CTG CTG CTG CTG CTG CTG AGC 


Leu Ala Arg Asn Leu Pro Val Ala Thr Pro Asp Pro Gly Met Phe Pro C ys Leu His His 
YTN GCN MGN AAY YTN CCN GTN GCN ACN CCN GAY CCN GGN ATG TTY CCN TGY YTN CAY CAY 

TTA GCT CGT AAT TTA CCT GTT GCT ACT CCT GAT CCT GGT ATG TTT CCT TGT TTA CAT CAT 
TTG GCC CGC AAC TTG CCC GTC GCC ACC CCC GAC CCC GGC TTC CCC TGC TTG CAC CAC 

CTT GCA CGA CTT CCA GTA GCA ACA CCA CCA GGA C CA CTT 

CTC GCG CGG CTC CCG GTG GCG ACG CCG CCG GGG - C CG CTC 

CTA AGA CTA 

CTA 

CTG AGG CTG 

CTG 

Ser cin Asn Leu Leu Arg Ala Val Ser Asn Met Leu Gin Lys Ala Arc, Gin Thr Leu Glu 
WSN CAR AAY YTN YTN MGN GCN GTN WSN AAY ATG YTN CAR AAR GCN MGN CAR ACN YTN GAR 

TCT CAA AAT TTA TTA CGT GCT GTT TCT AAT ATG TTA CAA AAA GCT CGT CAA ACT TTA GAA 
TCC CAG AAC TTG TTG CGC GCC GTC TCC AAC TTG CAG AAG GCC CGC CAG ACC TTG GAG 

TCA CTT CTT CGA GCA GTA TCA CTT GCA CGA ACA CTT 


TCG CTC CTC CGG GCG GTG TCG CTC GCG CGG ACG CTC 

CTA CTA AGA 

AGC CTG CTG AGG flG C CTG AGG CTG 


Phe Tyr Pro Cys Thr Ser Glu Glu He Asp His Glu Asp He Thr Lys Asp Lys Thr Ser 
TTY TAY CCN TGY ACN WSN GAR GAR ATH GAY CAY GAR GAY ATH ACN AAR GAY AAR ACN WSN 

TTT TAT CCT TGT ACT TCT GAA GAA ATT GAT CAT GAA GAT ATT ACT AAA GAT AAA^ ACT TCT 

CCA ACA TCA ATA ATA ACA ACA TCA 

CCG ACG TCG 

ACG ACG TCG 

AGT 

AGT 

AGC 

AGC 
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90 10 ° 

Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn Glu Ser Cys Leu Asn Ser Arg 

ACN GTN GAR GCN TGY YTN CCN YTN GAR YTN ACN AAR AAY GAR WSN TGY YTN AAY WSN MGN 

ACT GTT GAA GCT TGT TTA CCT TTA GAA TTA ACT AAA AAT GAA TCT TGT TTA AAT TCT CGT 
ACC GTC GAG GCC TGC TTG CCC TTG GAG TTG ACC AAG AAC GAG TCC TGC TTG AAC TCC CGC 
ACA GTA GCA CTT CCA CTT CTT ACA TCA CTT TCA CGA 

ACG GTG GCG CTC CCG CTC CTC ACG TCG CTC TCG CGG 

CTA CTA CTA AGT CTA AGT AGA 

CTG CTG CTG AGC CTG AGC AGG 

110 120 
Glu Thr Ser Phe lie Thr Asn Gly Ser Cys Leu Ala Ser Arg Lys Thr Ser Phe Met Met 
GAR ACN WSN TTY ATH ACN AAY GGN WSN TGY YTN GCN WSN MGN AAR ACN WSN TTY ATG ATG 

GAA ACT TCT TTT ATT ACT AAT GGT TCT TGT TTA GCT TCT CGT AAA ACT TCT TTT ATG ATG 
GAG ACC TCC TTC ATC ACC AAC GGC TCC TGC TTG GCC TCC CGC AAG ACC TCC TTC 
ACA TCA ATA ACA GGA TCA CTT GCA TCA CGA ACA TCA 

ACG TCG ACG GGG TCG CTC GCG TCG CGG ACG TCG 

AGT AGT CTA AGT AGA AGT 

AGC AGC CTG AGC AGG AGC 

130 "0 
Ala Leu Cys Leu Ser Ser lie Tyr Glu Asp Leu Lys Met Tyr Gin Val Glu Phe Lys Thr 
GCN YTN TGY YTN WSN WSN ATH TAY GAR GAY. YTN AAR ATG TAY CAR GTN GAR TTY AAR ACN 

GCT TTA TGT TTA TCT TCT ATT TAT GAA GAT TTA AAA ATG TAT CAA GTT GAA TTT AAA ACT 

GCC TTG TGC TTG TCC TCC ATC TAC GAG GAC TTG AAG TAG CAG GTC: GAG TTC AAG ACC 

GCA CTT CTT TCA TCA ATA CTT GTA ACA 

GCG CTC CTC TCG TCG CTC GTG ACG 

CTA CTA AGT AGT CTA 

CTG CTG AGC AGC CTG 


WO 99/47678 


PCT/US99/0S394 


42 


160 

Met Asn Ala Lys Leu Leu Met Asp Pro Lys Arg Gin He Phe Leu Asp Gin Asn Met Leu 
ATG AAY GCN AAR YTN YTN ATG GAY CCN AAR MGN CAR ATH TTY YTN GAY CAR AAY ATG YTN 

ATG AAT GCT AAA TTA TTA ATG GAT CCT AAA CGT CAA ATT TTT TTA GAT CAA AAT ATG TTA 
AAC GCC AAG TTG TTG GAC CCC AAG CGC CAG ATC TTC TTG GAC CAG AAC TTG 

GCA CTT CTT CCA CGA ATA CTT 


GCG CTC CTC CCG CGG CTC 

CTA CTA AGA C TA 

CTG CTG AGG CTG 


CTT 
CTC 
CTA 
CTG 


Ala val lie Asp Glu Leu Met Gin Ala Leu Asn Phe Asn Ser Glu Thr Val Pro Gin Lys 
GCN GTN ATH GAY GAR YTN ATG CAR GCN YTN AAY TTY AAY WSN GAR ACN GTN CCN CAR AAR 

GCT GTT ATT GAT GAA TTA ATG CAA GCT TTA AAT TTT AAT TCT GAA ACT GTT CCT CAA AAA 
GCC GTC ATC GAC GAG TTG CAG GCC TTG AAC TTC AAC TCC GAG ACC GTC CCC CAG AAG 

GCA GTA ATA CT T GCA CTT TCA ACAGTACCA 

GCG GTG CTC GCG CTC TCG ACG GTG CCG 

CTA CTA AGT 

CT G CTG AGC 


200 

Ser Ser Leu Glu Glu Pro Asp Phe Tyr Lys Thr Lys He Lys Leu Cys He Leu Leu His 
WSN WSN YTN GAR GAR CCN GAY TTY TAY AAR ACN AAR ATH AAR YTN TGY ATH YTN YTN CAY 

TCT TCT TTA GAA GAA CCT GAT TTT TAT AAA ACT AAA ATT AAA TTA TGT ATT TTA TTA CAT 
TCC TCC TTG GAG GAG CCC GAC TTC TAC AAG ACC AAG ATC AAG TTG TGC ATC TTG TTG CAC 
TCA TCA CTT CCA ACA ATA CTT ATA CTT CTT 

TCG TCG CTC CCG ACG CTC CTC CTC 

AGT AGT CTA _ 

CTA CTA CTA 

AGC AGC CTG 

CTG CTG CTG 


WO 99/47678 


PCT/US99/05394 


210 

Ala Phe Arg lie Arg Ala Val Thr lie Asp 
GCN TTY MGN ATH MGN GCN GTN ACN ATH GAY 


43 

220 

Arg Val Thr Ser Tyr Leu Asn Ala Ser *** 
MGN GTN ACN WSN TAY YTN AAY GCN WSN TRR 


GCT TTT CGT ATT CGT GCT GTT ACT ATT 
GCC TTC CGC ATC CGC GCC GTC ACC ATC 
GCA CGA ATA CGA GCA GTA ACA ATA 

GCG CGG CGG GCG GTG ACG 

AGA AGA 

AGG AGG 


CGT 

GTT 

ACT 

TCT 

TAT 

TTA 

AAT 

GCT 

TCT 

CGC 

GTC 

ACC 

TCC 

TAC 

TTG 

AAC 

GCC 

TCC 

CGA 

GTA 

ACA 

TCA 


CTT 


GCA 

TCA 

CGG 

GTG 

ACG 

TCG 


CTC 


GCG 

TCG 

AGA 



AGT 


CTA 



AGT 

AGG 



AGC 


CTG 



AGC 


Sequences Encoding Human IL-12 p4Q 

First line = natural sequence (SEQ ID NO. 2) 
Second line = all codons optimized (SEQ ID NO. 3) 
Third line - all codons optimized except when same 
nucleic acids were too close/abundant (changes between 
second and third lines bolded) (SEQ ID NO. 4) " 

ATG TGT CAC CAG CAG TTG GTC ATC TCT TGG TTT TCC CTG GTT TTT CTG GCA TCT CCC CTC 
ATG TGC CAC CAG CAG CTG GTG ATC AGC TGG TTC AGC CTG GTG TTC CTG GCC AGC CCC CTG 
ATG TGC CAC CAG CAG CTG GTG ATC AGC TGG TTC TCC CTG GTG TTT CTG GCC AGC CCC CTC 

GTG GCC ATA TGG GAA CTG AAG AAA GAT GTT TAT GTC GTA GAA TTG GAT TGG TAT CCG GAT 
GTG GCC ATC TGG GAG CTG AAG AAG GAC GTG TAC GTG . GTG GAG CTG GAC TGG TAC CCC GAC 
GTG GCC ATC TGG GAG CTG AAG AAA GAC GTG TAC GTG GTC GAG CTG GAC TGG TAC CCC GAC 

GCC CCT GGA GAA ATG GTG GTC CTC ACC TGT GAC ACC CCT GAA GAA GAT GGT ATC ACC TGG 
GCC CCC GGC GAG ATG GTG GTG CTG ACC TGC GAC ACC CCC GAG GAG GAC GGC ATC ACC TGG 
GCC CCC GGC GAG ATG GTG GTC CTG ACC TGC GAC ACC CCC GAG GAA GAC GGC ATC ACC TGG 

ACC TTG GAC CAG AGC AGT GAG GTC TTA GGC TCT GGC AAA ACC CTG ACC ATC CAA GTC AAA 
ACC CTG GAC CAG AGC AGC GAG GTG CTG GGC AGC GGC AAG ACC CTG ACC ATC CAG GTG AAG 
ACC CTG GAC CAG AGC AGT GAG GTG CTG GGC TCC GGC AAG ACC CTG ACC ATC CAG GTG AAG 

GAG TTT GGA GAT GCT GGC CAG TAC ACC TGT CAC AAA GGA GGC GAG GTT CTA AGC CAT TCG 
GAG TTC GGC GAC GCC GGC CAG TAC ACC TGC CAC AAG GGC GGC GAG GTG CTG AGC CAC AGC 
GAG TTC GGC GAC GCC GGC CAG TAC ACC TGC CAC AAG GGA GGC GAG GTG CTG AGC CAC TCC 


WO 99/47678 


PCT/US99/0S394 


44 


CTC CTG CTG CTT CAC AAA AAG GM GAT GGA ATT TGG TCC ACT GAT ATT TTA AAG GAC CAG 
CTG CTG CTG CTG CAC AAG AAG GAG GAC GGC ATC TGG AGC ACC GAC ATC CTG AAG GAC CAG 
CTC CTG CTG CTC CAC AAA AAG GAG GAC GGC ATC TGG AGC ACC GAC ATC CTG AAG GAC CAG 

AAA GAA CCC AAA AAT AAG ACC TTT CTA AGA TGC GAG GCC AAG AAT TAT TCT GGA CGT TTC 
AAG GAG CCC AAG AAC AAG ACC TTC CTG CGC TGC GAG GCC AAG AAC TAC AGC GGC CGC TTC 
AAG GAG CCC AAG AAC AAG ACC TTC CTG CGC TGC GAG GCC AAG AAC TAC AGC GGC CGC TTC 

ACC TGC TGG TGG CTG ACG ACA ATC ACT ACT GAT TTG ACA TTC AGT GTC AAA AGC AGC AGA 
ACC TGC TGG TGG CTG ACC ACC ATC AGC ACC GAC CTG ACC TTC AGG GTG AAG AGC AGC AGG 
ACC TGC TGG TGG CTG ACC ACG ATC AGC ACC GAC CTG ACC TTC AGT GTG AAG AGC AGC AGG 

GGC TCT TCT GAC CCC CAA GGG GTG ACG TGC GGA GCT GCT ACA CTC TCT GCA GAG AGA GTC 
GGC AGC AGC GAC CCC CAG GGC GTG ACC TGC GGC GCC GCC ACC CTG AGC GCC GAG CGC GTG 
GGC TCC AGC GAC CCC CAG GGC GTG ACC TGC GGC GCT GCC ACC CTG AGC GCC GAG CGC GTG 

AGA GGG GAC AAC AAG GAG TAT GAG TAC TCA GTG GAG TGC CAG GAG GAC AGT GCC TGC CCA 
CGC GGC GAC AAC AAG GAG TAC GAG TAC AGC GTG GAG TGC CAG GAG GAC AGC GCC TGC CCC 
CGC GGC GAC AAC AAG GAG TAC GAG TAC AGC GTG GAG TGC CAG GAA GAC TCC GCC TGC CCC 

GCT GCT GAG GAG AGT CTG CCC ATT GAG GTC ATG GTG GAT GCC GTT CAC AAG CTC AAG TAT 
GCC GCC GAG GAG AGC CTG CCC ATC GAG GTG ATG GTG GAC GCC GTC CAC AAG CTG AAG TAC 
GCC GCT GAG GAG AGC CTG CCC ATC GAG GTG ATG GTG GAC GCC GTT CAC AAG CTG AAG TAC 

GAA AAC TAC ACC AGC AGC TTC TTC ATC AGG GAC ATC ATC AAA CCT GAC CCA CCC AAG AAC 
GAG AAC TAC ACC AGC AGC TTC TTC ATC CGC GAC ATC ATC AAG CCC GAC CCC CCC AAG AAC 
GAG AAC TAC ACC AGC AGC TTC TTC ATC CGC GAC ATC ATC AAG CCT GAC CCA CCC AAG AAC 

TTG CAG CTG AAG CCA TTA AAG AAT TCT CGG CAG GTG GAG GTC AGC TGG GAG TAC CCT GAC 
CTG CAG CTG AAG CCC CTG AAG AAC AGC CGC CAG GTG GAG GTG AGC TGG GAG TAC CCC GAC 
CTC CAG CTG AAG CCC CTC AAG AAC TCC CGC CAG GTG GAG GTG AGC TGG GAG TAC CCC GAC 
ACC TGG AGT ACT CCA CAT TCC TAC TTC TCC CTG ACA TTC TGC GTT CAG GTC CAG GGC AAG 
ACC TGG AGC ACC CCC CAC AGC TAC TTC AGC CTG ACC TTC TGC GTG CAG GTG CAG GGC AAG 
ACC TGG AGC ACG CCC CAC TCC TAC TTC TCC CTG ACC TTC TGC GTG CAG GTC CAG GGC AAG 


WO 99/47678 


PCT/US99/05394 


AGC 

AAG 

AGA 

GAA 

AGC 

AAG 

CGC 

GAG 

AGC 

AAG 

CGC 

GAG 

CGC 

AAA 

AAT 

GCC 

CGC 

AAG 

AAC 

GCC 

CGC 

AAG 

AAC 

GCC 


AAG AAA GAT AGA 
AAG AAG GAC CGC 
AAG AAA GAC CGG 

AGC ATT AGC GTG 
AGC ATC AGC GTG 
AGC ATC AGC GTG 
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GTC TTC ACG GAC 
GTG TTC ACC GAC 
GTG TTC ACC GAC 

CGG GCC CAG GAC 
CGC GCC CAG GAC 
CGC GCC CAG GAC 


AAG ACC TCA GCC 
AAG ACC AGC GCC 
AAG ACC AGC GCC 

CGC TAC TAT AGC 
CGC TAC TAC AGC 
CGC TAC TAT AGC 


ACG GTC ATC TGC 
ACC GTG ATC TGC 
ACC GTC ATC TGC 

TCA TCT TGG AGC 
AGC AGC TGG AGC 
TCC TCT TGG AGC 


GAA 

TGG 

GCA 

TCT 

GTG 

CCC 

TGC 

AGT 

TAG 

GAG 

TGG 

GCC 

AGC 

GTG 

CCC 

TGC 

AGC 

TAG 

GAG 

TGG 

GCC 

AGC 

GTG 

CCC 

TGC 

TCC 

TAG 


Additional Optimized Sequences Coding For IL-12 p40 Subunit 
(Second Line SEQ ID NO:25) 

10 20 
Met Cys His Gin Gin Leu Val He Ser Trp Phe Ser Leu Val Phe Leu Ala Ser Pro Leu 
ATG TGY CAY CAR CAR YTN GTN ATH WSN TGG TTY WSN YTN GTN TTY YTN GCN WSN CCN YTN 


TGT 

CAT 

CAA 

CAA 

TTA 

GTT 

ATT 

TCT TGG 

TTT 

TCT 

TTA 

GTT 

TTT 

TTA 

GCT 

TCT 

CCT 

TTA 

TGC 

CAC 

CAG 

CAG 

TTG 

GTC 

ATC 

TCC 

TTC 

TCC 

TTG 

GTC 

TTC 

TTG 

GCC 

TCC 

CCC 

TTG 





CTT 

GTA 

ATA 

TCA 


TCA 

CTT 

GTA 


CTT 

GCA 

TCA 

CCA 

CTT 





CTC 

GTG 


TCG 


TCG 

CTC 

GTG 


CTC 

GCG 

TCG 

CCG 

CTC 





CTA 



AGT 


AGT 

CTA 



CTA 


AGT 


CTA 





CTG 



AGC 


AGC 

CTG 



CTG 


AGC 


CTG 


30 40 

Val Ala He Trp Glu Leu Lys Lys Asp Val Tyr Val Val Glu Leu Asp Trp Tyr Pro Asp 

GTN GCN ATH TGG GAR YTN AAR AAR GAY GTN TAY GTN GTN GAR YTN GAY TGG TAY CCN GAY 

GTT GCT ATT TGG GAA TTA AAA AAA GAT GTT TAT GTT GTT GAA TTA GAT TGG TAT CCT GAT 

GTC GCC ATC GAG TTG AAG AAG GAC GTC TAC GTC GTC GAG TTG GAC TAC CCC GAC 

GTA GCA ATA CTT GTA GTA GTA CTT CCA 

GTG GCG CTC GTG GTG GTG CTC CCG 

CTA CTA 

CTG CTG 


WO 99/47678 


PCT7US99/05394 


46 


50 60 
Ala Pro Gly Glu Met Val Val Leu Thr Cys Asp Thr Pro Glu Glu Asp Gly lie Thr Trp 
GCN CCN GGN GAR ATG GTN GTN YTN ACN TGY GAY ACN CCN GAR GAR GAY GGN ATH ACN TGG 


GCT CCT GGT GAA ATG GTT GTT 

GCC CCC GGC GAG GTC GTC 

GCA CCA GGA GTA GTA 

GCG CCG GGG GTG GTG 


TTA ACT TGT GAT ACT CCT 
TTG ACC TGC GAC ACC CCC 
CTT ACA ACA CCA 

CTC ACG ACG CCG 

CTA 
CTG 


GAA GAA GAT GGT ATT ACT TGG 
GAG GAG GAC GGC ATC ACC 
GGA ATA ACA 
GGG ACG 


70 80 
Thr Leu Asp Gin Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr He Gin Val Lys 
ACN YTN GAY CAR WSN WSN GAR GTN YTN GGN WSN GGN AAR ACN YTN ACN ATH CAR GTN AAR 


ACT TTA GAT CAA TCT TCT GAA GTT TTA GGT TCT GGT AAA ACT TTA ACT ATT CAA GTT AAA 
ACC TTG GAC CAG TCC TCC GAG GTC TTG GGC TCC GGC AAG ACC TTG ACC ATC CAG GTC AAG 
ACA CTT TCA TCA GTA CTT GGA TCA GGA ACA CTT ACA ATA GTA 

ACG CTC TCG TCG GTG CTC GGG TCG GGG ACG CTC ACG GTG 

CTA AGT AGT CTA AGT C TA 

CTG AGC AGC CTG AGC CTG. 


90 100 
Glu Phe Gly Asp Ala Gly Gin Tyr Thr Cys His Lys Gly Gly Glu Val Leu Ser His Ser 
GAR TTY GGN GAY GCN GGN CAR TAY ACN TGY CAY AAR GGN GGN GAR GTN YTN WSN CAY WSN 


GAA TTT GGT GAT GCT GGT CAA TAT ACT TGT 
GAG TTC GGC GAC GCC GGC CAG TAC ACC TGC 
GGA GCA GGA ACA 

GGG GCG GGG ACG 


CAT AAA GGT GGT GAA GTT TTA TCT CAT TCT 

CAC AAG GGC GGC GAG GTC TTG TCC CAC TCC 

GGA GGA GTA CTT TCA TCA 

GGG GGG GTG CTC TCG TCG 

CTA AGT AGT 

CTG AGC AGC 


WO 99/47678 


PCT/US99/05394 
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110 120 


Leu 

Leu 

Leu 

Leu 

His 

Lys 

Lys 

Glu Asp Gly 

He 

Trp 

Ser 

Thr 

Asp 

He 

Leu 

Lys 

Asp Gin 

YTN 

YTN 

YTN 

YTN 

CAY 

AAR 

AAR 

GAR 

GAY 

GGN 

ATH 

TGG 

WSN 

ACN 

GAY 

ATH 

YTN 

AAR 

GAY 

CAR 

TTA 

TTA 

TTA 

TTA 

CAT 

AAA 

AAA 

GAA 

GAT 

GGT 

ATT 

TGG 

TCT 

ACT 

GAT 

ATT 

TTA 

AAA 

GAT 

GAA 

TTG 

TTG 

TTG 

TTG 

CAC 

AAG 

AAG 

GAG 

GAC 

GGC 

ATC 


TCC 

ACC 

GAC 

ATC 

TTG 

AAG 

GAC 

CAG 

CTT 

CTT 

CTT 

CTT 






GGA 

ATA 


TCA 

ACA 


ATA 

CTT 




CTC 

CTC 

CTC 

CTC 






GGG 



TCG 

ACG 



CTC 




CTA 

CTA 

CTA 

CTA 









AGT 




CTA 




CTG 

CTG 

CTG 

CTG 









AGC 




CTG 













130 










140 

Lys 

Glu 

Pro 

Lys 

Asn 

Lys 

Thr 

Phe 

Leu 

Arg 

Cys 

Glu 

Ala 

Lys 

Asn Tyr 

Ser 

Gly Arg 

Phe 

AAR 

GAR 

CCN 

AAR 

AAY 

AAR 

ACN 

TTY 

YTN 

MGN 

TGY 

GAR 

GCN 

AAR 

AAY 

TAY 

WSN 

GGN 

MGN 

TTY 

AAA 

GAA 

CCT 

AAA 

AAT 

AAA 

ACT 

TTT 

TTA 

CGT 

TGT 

GAA 

GCT 

AAA 

AAT 

TAT 

TCT 

GGT 

CGT 

TTT 

AAG 

GAG 

CCC 

AAG 

AAC 

AAG 

ACC 

TTC 

TTG 

CGC 

TGC 

GAG 

GCC 

AAG 

AAC 

TAC 

TCC 

GGC 

CGC 

TTC 



CCA 




ACA 


CTT 

CGA 



GCA 




TCA 

GGA 

CGA 




CCG 




ACG 


CTC 

CGG 



GCG 




TCG 

GGG 

CGG 










CTA 

AGA 







AGT 


AGA 










CTG 

AGG 







AGC 


AGG 











150 










160 

Thr 

Cys 

Trp 

Trp 

Leu 

Thr 

Thr 

He 

Ser 

Thr 

Asp 

Leu 

Thr 

Phe 

Ser 

Val 

Lys 

Ser 

Ser 

Arg 

ACN 

TGY 

TGG 

TGG 

YTN 

ACN 

ACN 

ATH 

WSN 

ACN 

GAY 

YTN 

ACN 

TTY 

WSN 

GTN 

AAR 

WSN 

WSN 

MGN 

ACT 

TGT 

TGG 

TGG 

TTA 

ACT 

ACT 

ATT 

TCT 

ACT 

GAT 

TTA 

ACT 

TTT 

TCT 

GTT 

AAA 

TCT 

TCT 

CGT 

ACC 

TGC 



TTG 

ACC 

ACC 

ATC 

TCC 

ACC 

GAC 

TTG 

ACC 

TTC 

TCC 

GTC 

AAG 

TCC 

TCC 

CGC 

ACA 




CTT 

ACA 

ACA 

ATA 

TCA 

ACA 


CTT 

ACA 


TCA 

GTA 


TCA 

TCA 

CGA 

ACG 




CTC 

ACG 

ACG 


TCG 

ACG 


CTC 

ACG 


TCG 

GTG 


TCG 

TCG 

CGG 





CTA 




AGT 



CTA 



AGT 



AGT 

AGT 

AGA 





CTG 




AGC 



CTG 



AGC 



AGC 

AGC 

AGG 


WO 99/47678 


PCT/US99/05394 
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170 180 


Gly 

Ser 

Ser 

Asp 

Pro 

Gin 

Gly 

Val 

Thr 

Cys 

Gly 

Ala 

Ala 

Thr 

Leu 

Ser 

Ala 

Glu 

Arg 

Val 

GGN 

WSN 

WSN 

GAY 

CCN 

CAR 

GGN 

GTN 

ACN 

TGY 

GGN 

GCN 

GCN 

ACN 

YTN 

WSN 

GCN 

GAR 

MGN 

GTN 

GGT 

TCT 

TCT 

GAT 

CCT 

CAA 

GGT 

GTT 

ACT 

TGT 

GGT 

GCT 

GCT 

ACT 

TTA 


GCT 



GTT 

GGC 

TCC 

TCC 

GAC 

CCC 

CAG 

GGC 

GTC 

ACC 

TGC 

GGC 

GCC 

GCC 

ACC 

TTG 

TCC 

GCC 

GAG 

CGC 

GTC 

GGA 

TCA 

TCA 


CCA 


GGA 

GTA 

ACA 


GGA 

GCA 

GCA 

ACA 

CTT 

TCA 

GCA 


CGA 

GTA 

GGG 

TCG 

TCG 


CCG 


GGG 

GTG 

ACG 


GGG 

GCG 

GCG 

ACG 





CGG 

GTG 


AGT 

AGT 
















AGA 



AGC 

AGC 
















AGG 











1 90 










200 

Arg 

Gly 

Asp 

Asn 

Lys 

Glu 

Tyr 

Glu 

Tyr 

Ser 

Val 

Glu 

Cys 

Gin 

Glu 

Asp 

Ser 

Ala 

Cys 

Pro 

MGN 

GGN 

GAY 

AAY 

AAR 

GAR 

TAY 

GAR 

TAY 

WSN 

GTN 

GAR 

TGY 

CAR 

GAR 

GAY 

WSN 

GCN 

TGY 

CCN 

CGT 

GGT 

GAT 

AAT 

AAA 

GAA 

TAT 

GAA 

TAT 

TCT 

GTT 

GAA 

TGT 

CAA 

GAA 

GAT 

TCT 

GCT 



CGC 

GGC 

GAC 

AAC 

AAG 

GAG 

TAC 

GAG 

TAC 

TCC 

GTC 

GAG 

TGC 

CAG 

GAG 

GAC 

TCC 

GCC 

TGC 

CCC 

CGA 

GGA 








TCA 

GTA 






TCA 

GCA 


CCA 

CGG 

GGG 








TCG 

GTG 










AGA 









AGT 











AGG 









AGC 




















210 










220 

Ala 

Ala 

Glu 

Glu 

Ser 

Leu 

Pro 

He 

Glu 

Val 

Met 

Val 

Asp 

Ala 

Val 

His 

Lys 

Leu 

Lys 

Tyr 

GCN 

GCN 

GAR 

GAR 

WSN 

YTN 

CCN 

ATH 

GAR 

GTN 

ATG 

GTN 

GAY 

GCN 

GTN 

CAY 

AAR 

YTN 

AAR 

TAY 

GCT 

GCT 

GAA 

GAA 

TCT 

TTA 

CCT 

ATT 

GAA 

GTT 

ATG 

GTT 

GAT 

GCT 

GTT 

CAT 

AAA 

TTA 

AAA 

TAT 

GCC 

GCC 

GAG 

GAG 

TCC 

TTG 

CCC 

ATC 

GAG 

GTC 


GTC 

GAC 

GCC 

GTC 

CAC 

AAG 

TTG 

AAG 

TAC 

GCA 

GCA 



TCA 

CTT 

CCA 

ATA 


GTA 


GTA 


GCA 

GTA 



CTT 



GCG 

GCG 



TCG 

CTC 

CCG 



GTG 


GTG 


GCG 

GTG 



CTC 




AGT CTA CTA 
AGC CTG CTG 


WO 99/47678 


PCT/US99/05394 
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230 240 


Glu 

Asn 

Tyr 

Thr 

Ser 

Ser 

Phe 

Phe 

He 

Arg 

Asp 

He 

He 

Lys 

Pro 

Asp 

Pro 

Pro 

Lys 

Asn 

GAR 

AAY 

TAY 

ACN 

WSN 

WSN 

TTY 

TTY 

ATH 

MGN 

GAY 

ATH 

ATH 

AAR 

CCN 

GAY 

CCN 

CCN 

AAR 

AAY 

GAA 

AAT 

TAT 

ACT 

TCT 

TCT 

TTT 

TTT 

ATT 

CGT 

GAT 

ATT 

ATT 

AAA 

CCT 

GAT 

CCT 

CCT 

AAA 

AAT 

GAG 

AAC 

TAC 

ACC 

TCC 

TCC 

TTC 

TTC 

ATC 

CGC 

GAC 

ATC 

ATC 

AAG 

CCC 

GAC 

CCC 

CCC 

AAG 

AAC 




ACA 

TCA 

TCA 



ATA 

CGA 


ATA 

ATA 


CCA 


CCA 

CCA 






ACG 

TCG 

TCG 




CGG 





CCG 


CCG 

CCG 




AGT AGT AGA 
AGC AGC AGG 

250 260 


Leu 

Gin 

Leu 

Lys 

Pro 

Leu 

Lys 

Asn 

Ser 

Arg 

Gin 

Val 

Glu 

Val 

Ser 

Trp 

Glu 

Tyr 

Pro 

Asp 

YTN 

CAR 

YTN 

AAR 

CCN 

YTN 

AAR 

AAY 

WSN 

MGN 

CAR 

GTN 

GAR 

GTN 

WSN 

TGG 

GAR 

TAY 

CCN 

GAY 

TTA 

CAA 

TTA 

AAA 

CCT 

TTA 

AAA 

AAT 

TCT 

CGT 

CAA 

GTT 

GAA 

GTT 

TCT 

TGG 

GAA 

TAT 

CCT 

GAT 

TTG 

CAG 

TTG 

AAG 

CCC 

TTG 

AAG 

AAC 

TCC 

CGC 

CAG 

GTC 

GAG 

GTC 

TCC 


GAG 

TAC 

CCC 

GAC 

CTT 


CTT 


CCA 

CTT 



TCA 

CGA 


GTA 


GTA 

TCA 




CCA 


CTC 


CTC 


CCG 

CTC 



TCG 

CGG 


GTG 


GTG 

TCG 




CCG 


CTA 


CTA 



CTA 



AGT 

AGA 





AGT 






CTG 


CTG 



CTG 



AGC 

AGG 





AGC 















270 










280 

Thr 

Trp 

Ser 

Thr 

Pro 

His 

Ser 

Tyr 

Phe 

Ser 

Leu 

Thr 

Phe 

Cys 

Val 

Gin 

Val 

Gin Gly 

Lys 

ACN 

TGG 

WSN 

ACN 

CCN 

CAY 

WSN 

TAY 

TTY 

WSN 

YTN 

ACN 

TTY 

TGY 

GTN 

CAR 

GTN 

CAR 

GGN 

AAR 

ACT 

TGG 

TCT 

ACT 

CCT 

CAT 

TCT 

TAT 

TTT 

TCT 

TTA 

ACT 

TTT 

TGT 

GTT 

CAA 

GTT 

CAA 

GGT 

AAA 

ACC 


TCC 

ACC 

CCC 

CAC 

TCC 

TAC 

TTC 

TCC 

TTG 

ACC 

TTC 

TGC 

GTC 

CAG 

GTC 

CAG 

GGC 

AAG 

ACA 


TCA 

ACA 

CCA 


TCA 



TCA 

CTT 

ACA 



GTA 


GTA 


GGA 


ACG 


TCG 

ACG 

CCG 


TCG 



TCG 

CTC 

ACG 



GTG 


GTG 


GGG 



AGT AGT AGT CTA 

AGC AGC AGC CTG 
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290 300 
Ser Lys Arg Glu Lys Lys Asp Arg Val Phe Thr Asp Lys Thr Ser Ala Thr Val lie Cys 
WSN AAR MGN GAR AAR AAR GAY MGN GTN TTY ACN GAY AAR ACN WSN GCN ACN GTN ATH TGY 

TCT AAA CGT GAA AAA AAA GAT CGT GTT TTT ACT GAT AAA ACT TCT GCT ACT GTT ATT TGT 
TCC AAG CGC GAG AAG AAG GAC CGC GTC TTC ACC GAC AAG ACC TCC GCC ACC GTC ATC TGC 
TCA CGA CGA GTA ACA ACA TCA GCA ACA GTA ATA 

TCG CGG CGG GTG ACG ACG TCG GCG ACG GTG 

AGT AGA AGA AGT 

AGC AGG AGG AGC 


Arg Lys Asn Ala Ser He Ser Val Arg Ala Gin Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
MGN AAR AAY GCN WSN ATH WSN GTN MGN GCN CAR GAY MGN TAY TAY WSN WSN WSN TGG WSN 

CGT AAA AAT GCT TCT ATT TCT GTT CGT GCT CAA GAT CGT TAT TAT TCT TCT TCT TGG TCT 

CGC AAG AAC GCC TCC ATC TCC GTC CGC GCC CAG GAC CGC TAC TAC TCC TCC TCC TCC 

CGA GCA TCA ATA TCA GTA CGA GCA CGA TCA TCA TCA TCA 

CGG GCG TCG TCG GTG CGG GCG CGG TCG TCG TCG TCG 


AGA AGT AGT AGA 

AGG AGC AGC AGG 


AGA AGT AGT AGT AGT 

AGG .AGC AGC AGC AGC 


Glu Trp Ala Ser Val Pro Cys Ser *** 
GAR TGG GCN WSN GTN CCN TGY WSN TRR 


GAA TGG GCT TCT GTT CCT TGT TCT TAA 
GAG GCC TCC GTC CCC TGC TCC TAG 

GCA TCA GTA CCA TCA TGA 

GCG TCG GTG CCG TCG 
AGT AGT 
AGC AGC 
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Wild Type Sequence Coding For Interferon Alpha 

9 18 27 36 45 54 

5' ATG GCC TTG ACC TTT GCT TTA CTG GTG GCC CTC CTG GTG CTC AGC TGC AAG TCA 

MA LT F ALLVAL LV L S C KS 

63 72 81 90 99 108 

AGC TGC TCT GTG GGC TGT GAT CTG CCT CAA ACC CAC AGC CTG GGT AGC AGG AGG 

SCSVGCDLPQTHSLGSRR 

117 126 135 144 153 162 

ACC TTG ATG CTC CTG GCA CAG ATG AGG AGA ATC TCT CTT TTC TCC TGC TTG AAG 

TLMLLAQMRRI SLFSCLK 

171 180 189 198 207 216 

GAC AGA CAT GAC TTT GGA TTT CCC CAG GAG GAG TTT GGC AAC CAG TTC CAA AAG 

DRHDFGFPQEE FGNQFQK 

225 234 243 252 261 270 

GCT GAA ACC ATC CCT GTC CTC CAT GAG ATG ATC CAG CAG ATC TTC AAT CTC TTC 

A E T I PVLHEMI QQI FN LF 

279 288 297 306 315 324 

AGC ACA AAG GAC TCA TCT GCT GCT TGG GAT GAG ACC CTC CTA GAC AAA TTC TAC 

STKDS SAAWDETLLDKFY 

333 342 351 360 369 378 

ACT GAA CTC TAC CAG CAG CTG AAT GAC CTG GAA GCC TGT GTG ATA CAG GGG GTG 

TELYQQLNDLEACVIQGV 
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387 396 «>S 4!4 423 <32 

GGG GTG ACA GAG ACT CCC CTG ATG AAG GAG GAC TCC ATT CTG GCT GTG AGG AAA 

GVT ETP LMKEDS1LAVrk 

441 450 "9 468 477 486 

TAC TTC CAA AGA ATC ACT CTC TAT CTG AAA GAG AAG AAA TAC AGC CCT TGT GCC 

YF0RIT LYLKEKKy S PCA 

495 504 513 522 631 540 

TGG GAG GTT GTC AGA GCA GAA ATC ATG AGA TCT TTT TCT TTG TCA ACA AAC TTG 

WEVVRftE I"RSFSLSTN L 

549 558 567 

CAA GAA AGT TTA AGA AGT AAG GAA TGA 3' 

QESLRSKE 

Interferon Al ph a Coding Sequence with All Codons Optimized 
(SEQ ID NO:ll) " " 

ATG GCC CTG ACC TTC GCC CTG CTG GTG GCC CTG CTG GTG CTG AGC TGC AAG AGC AGC TGC 
TCC GTG GGG TGC GAC CTG CCC CAG ACC CAC AGC CTG GGG AGC CGG CGG ACC CTG ATG CTG 
CTG GCC CAG ATG CGG CGG ATC AGC CTG TTC AGC TGC CTG AAG GAC CGG CAC GAC TTC GGG 
TTC CCC CAG GAG GAG TTC GGG AAC CAG TTC CAG AAG GCC GAG ACC ATC CCC GTG CTG CAC 
GAG ATG ATC CAG CAG ATC TTC AAC CTG TTC AGC ACC AAG GAC AGC AGC GCC GCC TGG GAC 
GAG ACC CTG CTG GAC AAG TTC TAC ACC GAG CTG TAC CAG CAG CTG AAC GAC CTG GAG GCC 
TGC GTG ATC CAG GGG GTG GGG GTG ACC GAG ACC CCC CTG ATG AAG GAG GAC AGC ATC CTG 
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GCC GTG CGG AAG TAC TTC CAG CGG ATC ACC CTG TAC CTG AAG GAG AAG AAG TAC TCC CCC 
TGC GCC TGG GAG GTG GTG CGG GCC GAG ATC ATG CGG AGC TTC AGC CTG AGC ACC AAC CTG 
CAG GAG AGC CTG CGG AGC AAG GAG TGA 

Additional /Semi Optimized Sequence Coding For Interferon 
Alpha (Second Line = SEQ ID NO: 12) 

10 20 
MET ALA LEU THR PHE ALA LEU LEU VAL ALA LEU LEU VAL LEU SER CYS LYS SER SER CYS 
ATG GCN YTN ACN TTY GCN YTN YTN GTN GCN YTN YTN GTN YTN WSN TGY AAR WSN WSN TGY 

ATG GCT TTA ACT TTT GCT TTA TTA GTT GCT TTA TTA GTT TTA TCT TGT AAA TCT TCT TGT 
GCC TTG ACC TTC GCC TTG TTG GTC GCC TTG TTG GTC TTG TCC TGC AAG TCC TCC TGC 
GCA CTT ACA GCA CTT CTT GTA GCA CTT CTT GTA CTT TCA TCA TCA 

GCG CTC ACG GCG CTC CTC GTG GCG CTC CTC GTG CTC TCG TCG TCG 

CTA CTA CTA CTA CTA CTA AGT AGT AGT 

CTG CTG CTG CTG CTG CTG AGC AGC AGC 

30 40 
SER VAL GLY CYS ASP LEU PRO GLN THR HIS SER LEU GLY SER ARG ARG THR LEU MET LEU 
WSN GTN GGN TGY GAY YTN CCN CAR ACN CAY WSN YTN GGN WSN MGN MGN ACN YTN ATG YTN 

TCT GTT GGT TGT GAT TTA CCT CAA ACT CAT TCT TTA GGT TCT CGT CGT ACT TTA ATG TTA 

TCC GTC GGC TGC GAC TTG CCC CAG ACC CAC TCC TTG GGC TCC CGC CGC ACC TTG TTG 

TCA GTA GGA CTT CCA ACA TCA CTT GGA TCA CGA CGA ACA CTT CTT 

TCG GTG GGG CTC CCG ACG TCG CTC GGG TCG CGG CGG ACG CTC CTC 

AGT CTA AGT CTA AGT AGA AGA CTA CTA 

AGC CTG AGC CTG AGC AGG AGG CTG CTG 
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50 60 
LEU ALA GLN MET ARG ARG ILE SER LEU PHE SER CYS LEU LYS ASP ARC HIS ASP PHE GLY 
YTN GCN CAR ATG MGN MGN ATH WSN YTN TTY WSN TGY YTN AAR GAY MGN CAY GAY TTY GGN 

TTA GCT CAA ATG CGT CGT ATT TCT TTA TTT TCT TGT TTA AAA GAT CGT CAT GAT TTT GGT 

TTG GCC CAG CGC CGC ATC TCC TTG TTC TCC TGC TTG AAG GAC CGC CAC GAC TTC GGC 

CTT GCA CGA CGA ATA TCA CTT TCA CTT CGA GGA 

CTC GCG CGG CGG TCG CTC TCG CTC CGG GGG 

CTA AGA AGA AGT CTA AGT CTA AGA 

CTG AGG AGG AGC CTG AGC CTG AGG 

70 80 
PHE PRO GLN GLU GLU PHE GLY ASN GLN PHE GLN LYS ALA GLU THR ILE PRO VAL LEU HIS 
TTY CCN CAR GAR GAR TTY GGN AAY CAR TTY CAR AAR GCN GAR ACN ATH CCN GTN YTN CAY 

TTT CCT CAA GAA GAA TTT GGT AAT CAA TTT CAA AAA GCT GAA ACT ATT CCT GTT TTA CAT 
TTC CCC CAG GAG GAG TTC GGC AAC CAG TTC CAG AAG GCC GAG ACC ATC CCC GTC TTG CAC 


CCA GGA GCA ACA ATA CCA GTA CTT 

CCG GGG GCG ACG CCG GTG CTC 


CTA 
CTG 

90 100 
GLU MET ILE GLN GLN ILE PHE ASN LEU PHE SER THR LYS ASP SER SER ALA ALA TRP ASP 
GAR ATG ATH CAR CAR ATH TTY AAY YTN TTY WSN ACN AAR GAY WSN WSN GCN GCN TGG GAY 

GAA ATG ATT CAA CAA ATT TTT AAT TTA TTT TCT ACT AAA GAT TCT TCT GCT GCT TGG GAT 
GAG ATC CAG CAG ATC TTC AAC TTG TTC TCC ACC AAG GAC TCC TCC GCC GCC GAC 

ATA ATA CTT TCA ACA TCA TCA GCA GCA 

CTC TCG ACG TCG TCG GCG GCG 

CTA AGT AGT AGT 

CTG AGC AGC AGC 
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no 

GLU THR LEU LEU ASP LYS PHE TYR THR GLU 
GAR ACN YTN YTN GAY AAR TTY TAY ACN GAR 


120 

LEU TYR GLN GLN LEU ASN ASP LEU GLU ALA 
YTN TAY CAR CAR YTN AAY GAY YTN GAR GCN 


GAA ACT TTA TTA GAT AAA TTT TAT ACT GAA TTA TAT CAA CAA TTA AAT GAT TTA GAA GCT 

GAG ACC TTG TTG GAC AAG TTC TAC ACC GAG TTG TAC CAG CAG TTG AAC GAC TTG GAG GCC 

ACA CTT CTT ACA CTT CTT CTT GCA 

ACG CTC CTC ACG CTC CTC CTC GCG 

CTA CTA CTA CTA CTA 

CTG CTG CTG CTG CTG 

130 140 
CYS VAL ILE GLN GLY VAL GLY VAL THR GLU THR PRO LEU MET LYS GLU ASP SER ILE LEU 
TGY GTN ATH CAR GGN GTN GGN GTN ACN GAR ACN CCN YTN ATG AAR GAR GAY WSN ATH YTN 


TGT GTT ATT CAA GGT GTT GGT GTT ACT GAA ACT CCT TTA ATG AAA GAA GAT TCT ATT TTA 

TGC GTC ATC CAG GGC GTC GGC GTC ACC GAG ACC CCC TTG AAG GAG GAC TCC ATC TTG 

GTA ATA GGA GTA GGA GTA ACA ACA CCA CTT TCA ATA CTT 

GTG GGG GTG GGG GTG ACG ACG CCG CTC TCG CTC 

CTA AGT CTA 

CTG AGC CTG 


150 

ALA VAL ARG LYS TYR PHE GLN ARG ILE THR 
GCN GTN MGN AAR TAY TTY CAR MGN ATH ACN 


160 

LEU TYR LEU LYS GLU LYS LYS TYR SER PRO 
YTN TAY YTN AAR GAR AAR AAR TAY WSN CCN 


GCT GTT CGT AAA TAT TTT CAA CGT ATT ACT TTA TAT TTA AAA GAA AAA AAA TAT TCT CCT 

GCC GTC CGC AAG TAC TTC CAG CGC ATC ACC TTG TAC TTG AAG GAG AAG AAG TAC TCC CCC 

GCA GTA CGA CGA ATA ACA CTT CTT TCA CCA 

GCG GTG CGG CGG ACG CTC CTC TCG CCG 

AGA AGA CTA CTA AGT 

AGG AGG CTG CTG AGC 
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170 180 
CYS ALA TRP GLU VAL VAL ARG ALA GLU ILE MET ARG SER PHE SER LEU SER THR ASN LEU 
TGY GCN TGG GAR GTN GTN MGN GCN GAR ATH ATG MGN WSN TTY WSN YTN WSN ACN AAY YTN 

TGT GCT TGG GAA GTT GTT CGT GCT GAA ATT ATG CGT TCT TTT TCT TTA TCT ACT AAT TTA 

TGC GCC GAG GTC GTC CGC GCC GAG ATC CGC TCC TTC TCC TTG TCC ACC AAC TTG 

GCA GTA GTA CGA GCA ATA CGA TCA TCA CTT TCA ACA CTT 

GCG GTG GTG CGG GCG CGG TCG TCG CTC TCG ACG CTC 

AGA AGA AGT AGT CTA AGT CTA 

AGG AGG AGC AGC CTG AGC CTG 


GLN GLU SER LEU ARG SER LYS GLU *** 
CAR GAR WSN YTN MGN WSN AAR GAR TRR 


CAA GAA TCT TTA CGT TCT AAA GAA TAA 
CAG GAG TCC TTG CGC TCC AAG GAG TAG 

TCA CTT CGA TCA TGA 

TCG CTC CGG TCG 

AGT CTA AGA AGT 

AGC CTG AGG AGC 


Delivery and expression of nucleic acids in many 
formulations is limited due to degradation of the nucleic 
acids by components of organisms, such as nucleases. Thus, 
protection of the nucleic acids when delivered in vivo can 
greatly enhance the resulting expression, thereby enhancing 
a desired pharmacological or therapeutic effect. It was 
found that certain types of compounds which interact with a 
nucleic acid (e.g., DNA) in solution but do not condense the 
nucleic acid provide in vivo protection to the nucleic acid, 
and correspondingly enhance the expression of an encoded 
gene product. 

We have described the use of delivery systems designed 
to interact with plasmids and protect plasmids from rapid 
extracellular nuclease degradation [Mumper, R.J., e t al., 
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1996, Pharm. Res. 13:701-709; Mumper, R.J., et al., 1997. 
Submitted to Gene Therapy] . A characteristic of the PINC 
systems is that they are non-condensing, systems that allow 
the plasmid to maintain flexibility and diffuse freely 
throughout the muscle while being protected from nuclease 
degradation. While the PINC systems are primarily discussed 
below, it will be understood that cationic lipid based 
systems and systems utilizing both PINCS and cationic lipids 
are also within the scope of the present invention. 

A common structural component of the PINC systems is 
that they are amphiphilic molecules, -having both a 
hydrophilic and a hydrophobic portion. The hydrophilic 
portion of the PINC is meant to interact with plasmids by 
hydrogen bonding (via hydrogen bond acceptor or donor 
groups) , Van der Waals interactions, or/and by ionic 
interactions. For example, PVP and N-methyl-2-pyrrolidone 
(NM2P) are hydrogen bond acceptors while PVA and PG are 
hydrogen bond donors . 

All four molecules have been reported to form complexes 
with various (poly ) anionic molecules [Buhler V., BASF 
Aktiengescellschaf t Feinchemie, Ludwigshaf en, pp 39-42; 
Galaev Y, et al., J. Chrom. A. 684:45-54 (1994); Tarantino 
R, et al. J. Pharm. Sci . 83:1213-1216 (1994); Zia, H., et 
al., Pharm. Res. 8:502-504 (1991);]. The hydrophobic 

portion of the PINC systems is designed to result in a 
coating on the plasmid rendering its surface more 
hydrophobic. Kabanov et al. have described previously the 
use of cationic polyvinyl derivatives for plasmid 
condensation designed to increase plasmid hydrophobicity , 
protect plasmid from nuclease degradation, and increase its 
affinity for biological membranes [Kabanov, A.V., and 
Kabanov, V.A., 1995, Bioconj . Chem. 6:7-20; Kabanov, A.V., 
et al., 1991, Biopolymers 31:1437-1443; Yaroslavov, A. A., et 
al., 1996, FEBS Letters 384:177-180]. 

Substantial protective effect is observed; up to at 
least a one log enhancement of gene expression in rat muscle 
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over plasmid formulated in saline has been demonstrated with 
these exemplary PINC systems. We have also found that the 
expression of reporter genes in muscle using plasmids 
complexed with the PINC systems was more reproducible than 
when the plasmid was formulated in saline. For example, the 
coefficient of variation for reporter gene expression in 
muscle using plasmid formulated in saline was- 96 + 35% ( n = 
20 studies; 8-12 muscles/study) whereas with coefficient of 
variation with plasmids complexed with PINC systems was 40 + 
19% (n = 30 studies; 8-12 muscles/study) . The high 

coefficient of variation for reporter gene expression with 
plasmid formulated in saline has been described previously 
[Davis, H.L., et al . , 1993, Hum. Gene Ther . 4:151-9]. in 
addition, in contrast with the results for DNA: saline, there 
was no significant difference in gene expression in muscle 
when plasmid with different topologies were complexed with 
polyvinyl pyrrolidone (PVP) . This suggests that PVP is able 
to protect all forms of the plasmid from rapid nuclease 
degradation . 


1 • Summary of interactions between a PINC 
polymer (PVP) and plasmid 
We have demonstrated using molecular modeling that an 
exemplary PINC polymer, PVP, forms hydrogen bonds with the 
base pairs of a plasmid within its major groove and results 
in a hydrophobic surface on the plasmid due to the vinyl 
backbone of PVP. These interactions are supported by the 
modulation of plasmid zeta potential by PVP as well as by 
the inhibition of ethidium bromide intercalation into 
complexed plasmid. We have correlated apparent binding 

between PVP and plasmid to pH and salt concentration and 
have demonstrated the effect of these parameters on D-gal 
expression after intramuscular injection of plasmid/PVP 
complexes [Mumper, R.J., et al., 1997. Submitted to Gene 
Therapy] . a summary of the physico-chemical properties of 
plasmid/PVP complexes is listed in Table I below. 
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Table I: Summary of the Physico-Chemical Properties of 
Plasmid/PVP Complexes 


Method 

Molecular modeling 
Fourier-trans formed 
Infra-red Hydrogen 
DNase I challenge 


Microt it ration 
Calorimetry 

Potentiometric titration 


Dynamic Dialysis 


Zeta potential 
modulation 
Ethidium bromide 
Intercalation 

Osmotic pressure 

Luminescence 
Spectroscopy 


Result 

Hydrogen bonding and 
hydrophobic plasmid surface 
observed bonding demonstrated 
Decreased rate of plasmid 
degradation in the presence 
of PVP 

Positive heats of reaction 
indicative of an endothermic 
process 

One unit pH drop when plasmid 
and PVP are complexed 
Rate of diffusion of PVP 
reduced in the presence of 
plasmid 

Surface charge of plasmid 
decreased by PVP 
Ethidium bromide 
intercalation reduced by 
plasmid/PVP complexation 
Hyper-osmotic formulation 
(i.e., 340 mOsm/kg H 2 0) 
Plasmid/PVP binding decreased 
in salt and/or at pH 7 


5 2 . Histology of expression in muscle 

Immunohistochemistry for p-gal using a slide scanning 
technology has revealed the uniform distribution of p-gal 
expression sites across the whole cross-sections of rat 
tibialis muscles. Very localized areas were stained positive 
10 for p-gal when CMV-p-gal plasmid was formulated in saline, 
p-gal positive cells were observed exclusively around the 
needle tract when plasmid was injected in saline. This is 
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in agreement with previously published results [Wolff, J. A., 
et al., 1990, Science 247:1465-68; Davis, H.L., et al . , 
1993, Hum. Gene Ther. 4:151-9; Davis, H.L., et al., 1993, 
Hum. Gene Ther. 4:733-40]. 
5 In comparison, immunoreactivity for p-gal was observed 

in a wide area of muscle tissue after intramuscular 
injection of CMV-p-gal plasmid/PVP complex (1:17 w/w) in 150 
mM NaCl. It appeared that the majority of positive muscle 
fibers were located at the edge of muscle bundles. Thus, 

10 staining for p-gal in rat muscle demonstrated that, using a 
plasmid/PVP complex, the number of muscle fibers stained 
positive for P-gal was approximately 8-fold greater than 
found using a saline formulation. Positively stained 

muscle fibers were also observed over a much larger area in 

15 the muscle tissue using the plasmid/PVP complex providing 
evidence that the injected plasmid was widely dispersed 
after intramuscular injection. 

We conclude that the enhanced plasmid distribution and 
expression in rat skeletal muscle was a result of both 

20 protection from extracellular nuclease degradation due to 
complexation and hyper-osmotic effects of . the plasmid/PVP 
complex. However, Dowty and Wolff et al. have demonstrated 
that osmolarity, up to twice physiologic osmolarity, did not 
significantly effect gene expression in muscle [Dowty, M.E., 

25 and Wolff, J. A. In: J. A. Wolff (Ed.), 1994, Gene 

Therapeutics: Methods and Applications of Direct Gene 
Transfer. Birkhauser, Boston, pp. 82-98]. This suggests 
that the enhanced expression of plasmid due to PVP 
complexation is most likely due to nuclease protection and 

30 less to osmotic effects. Further, the surface modification 
of plasmids by PVP (e.g., increased hydrophobicity and 
decreased negative surface charge) may also facilitate the 
uptake of plasmids by muscle cells. 
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3 . Structure-activity relationship of PINC 
polymers 

We have found a linear relationship between the 
structure of a series of co-polymers of vinyl pyrrolidone 
5 and vinyl acetate and the levels of gene expression in rat 
muscle. We have found that the substitution of some vinyl 
pyrrolidone monomers with vinyl acetate monomers in PVP 
resulted in a co-polymer with reduced ability to form 
hydrogen bonds with plasmids. The reduced interaction 
10 subsequently led to decreased levels of gene expression in 
rat muscle after intramuscular injection. The expression of 
p-gal decreased linearly (R = 0.97) as the extent of vinyl 
pyrrolidone monomer (VPM) content in the co-polymers 
decreased. 

15 These data demonstrate that pH and viscosity are not 

the most important parameters effecting delivery of plasmid 
to muscle cells since these values were equivalent for all 
complexes. These data suggest that enhanced binding of the 
PINC polymers to plasmid results in increased protection and 

20 bioavailability of plasmid in muscle. 

4 . Additional PINC systems 

The structure-activity relationship described above can 
be used to design novel co-polymers that will also have 
enhanced interaction with plasmids. It is expected that 

25 there is w an interactive window of opportunity" whereby 
enhanced binding affinity of the PINC systems will result in 
a further enhancement of gene expression after their 
intramuscular injection due to more extensive protection of 
plasmids from nuclease degradation. It is expected that 

30 there will be an optimal interaction beyond which either 
condensation of plasmids will occur or "triplex" type 
formation, either of which can result in decreased 
bioavailability in muscle and consequently reduced gene 
expression. 
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As indicated above, the PINC compounds are generally 
amphiphilic compounds having both a hydrophobic portion and 
a hydrophilic portion. In many cases the hydrophilic 
portion is provided by a polar group. it is recognized in 
5 the art that such polar groups can be provided by groups 
such as, but not limited to, pyrrolidone, alcohol, acetate, 
amine or heterocyclic groups such as those shown on pp. 2-73 
and 2-74 of CRC Handbook of Chemistry and Physics (72nd 
Edition), David R. Lide, editor, including pyrroles, 
10 pyrazoles, imidazoles, triazoles, dithiols, oxazoles, 
(iso)thiazoles, oxadiazoles, oxatriazoles , diaoxazoles, 
oxathioles, pyrones, dioxins, pyridines, pyridazines,' 
pyrimidines, pyrazines, piperazines, (iso) oxazines, indoles,' 
indazoles, carpazoles, and purines and derivatives of these 
15 groups, hereby incorporated by reference. 

Compounds also contain hydrophobic groups which, in the 
case of a polymer, are typically contained in the backbone 
of the molecule, but which may also be part of a non- 
polymeric molecule. Examples of such hydrophobic backbone 
20 groups include, but are not limited to, vinyls, ethyls, 
acrylates, acrylamides, esters, celluloses, amides, 
hydrides, ethers, carbonates, phosphazenes , sulfones, 
propylenes, and derivatives of these groups. The polarity 
characteristics of various groups are quite well known to 
25 those skilled in the art as illustrated, for example, by 
discussions of polarity in any introductory organic 
chemistry textbook. 

The ability of such molecules to interact with nucleic 
acids is also understood by those skilled in the art, and 
can be predicted by the use of computer programs which model 
such intermolecular interactions. Alternatively or in 

addition to such modeling, effective compounds can readily 
be identified using one or more of such tests as 1) 
determination of inhibition of the rate of nuclease 
digestion, 2) alteration of the zeta potential of the DNA, 
which indicates coating of DNA, 3) or inhibition of the 
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ability of intercalating agents, such as ethidium bromide to 
intercalate with DNA. 

5 . Targeting Ligands 

In addition to the nucleic acid/PINC complexes 
described above for delivery and expression of nucleic acid 
sequences, in particular embodiments it is also useful to 
provide a targeting ligand in order to preferentially obtain 
expression in particular tissues, cells, or cellular regions 
or compartments. 

Such a targeted PINC complex includes a PINC system 
{monomer ic or polymeric PINC compound) complexed to plasmid 
(or other nucleic acid molecule). The PINC system is 
covalently or non-covalently attached to (bound to) a 
targeting ligand (TL) which binds to receptors having an 
affinity for the ligand. Such receptors may be on the 
surface or within compartments of a cell. Such targeting 
provides enhanced uptake or intracellular trafficking of the 
nucleic acid. 

The targeting ligand may include, but is not limited 
to, galactosyl residues, fucosal residues, mannosyl 
residues, carnitine derivatives, monoclonal antibodies, 
polyclonal antibodies, peptide ligands, and DNA-binding 
proteins. Examples of cells which may usefully be targeted 
include, but are not limited to, antigen-presenting cells, 
hepatocytes, myocytes, epithelial cells, endothelial cells, 
and cancer cells. 

Formation of such a targeted complex is illustrated by 
the following example of covalently attached targeting 
ligand (TL) to, PINC system: 

TL-PINC + Plasmid > TL-PINC: : : : : : Plasmid 

Formation of such a targeted complex is also 
illustrated by the following example of non-covalently 
attached targeting ligand (TL) to PINC system 

TL::::::PINC + Plasmid > TL: ::::: PINC :::::: Plasmid 

or alternatively, 
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PINC + Plasmid > PTwr D1 

f INC. :::::: Plasraxd + TL 

> TL: : : : : : PINC: :::::: Plasmid 

In these examples :::::::: is non-covalent interaction such 
as ionic, hydrogen-bonding, Van der Waals interaction, 
5 hydrophobic interaction, or combinations of such 
interactions . 

A targeting method for cytotoxic agents is described in 
Subramanian et al.. International Application No 
PCT/US96/08852, International Publication No. WO 96/39124' 
10 hereby incorporated by reference. This application 

describes the use of poller affinity systems for targeting 
cytotoxic materials using a two-step targeting method 
involving zip polymers, i.e., pairs of interacting polymers 
An antibody attached to one of the interacting pollers 
15 binds to a cellular target. That polymer then acts as a 
target for a second polymer attached to a cytotoxic agent 
As referenced in Subramanian et al., other two-step (or 
multi-step) systems for delivery of toxic agents are also 
described. 

In another aspect, nucleic acid coding sequences can be 
delivered and expressed using a two-step targeting approach 
involving a non-natural target for a PINC system or PINC- 
targeting ligand complex. Thus, for example, a PINC-plasmid 
complex can target a binding pair member which is itself 
attached to a ligand which binds to a cellular target (e g 
a MAB) . Binding pairs for certain of the compounds 

identified herein as PINC compounds as identified in 
Subramanian et al. Alternatively, the PINC can be complexed 
to a tareting ligand, such as an antibody. That antibody 
can be targeted to a non-natural target which binds to, for 
example, a second antibody. 
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111 • M ° del s V stem s for Evalu ation of Interferon Alp h* 
Constructs and Formulations 

In accord with the concept of using interferon alpha 
35 expressing plasmid constructs and formulations in anti- 
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cancer treatment, murine model systems were utilized based 
on murine tumor cell lines. The line primarily used was 
S.C. VII/SF, which is a cell line derived from murine 
squamous cell carcinoma (S.C). 

Squamous cell carcinoma of the head and neck begins 
with the cells lining the oral and pharyngeal cavities. 
Clinical disease progresses via infiltration and spreads 
into the underlying tissues and lymphatics. The 
undifferentiated, in vivo passage tumor line S.C. VII/SF 
displays this typical growth pattern. In addition, its 
rapid growth rate provides a relatively short test period 
for individual experiments. Other murine tumor cell lines 
include another SCC line KLN-205, a keratinocyte line 1-7, 
and a colon adenocarcinoma line MC-38. 

An optimal model system preferably satisfies the 
criteria based on having tumor growth rate in vivo (i.e., 
tumors are ready for treatment in 4-10 days post implant), 
invasiveness, and local spread similar to those observed in 
clinical disease, and providing accessibility for 
experimental treatment. As indicated, the SCC VII/SF cell 
line was utilized as the primary model system cell line. 
This cell line typically grows rapidly, resulting in death 
of untreated syngeneic mice 14-17 ; days after tumor cell 
implantation. 

This cell line can be utilized in a variety of ways to 
provide model system suitable for a variety of different 
tests. Four such possibilities are described below. 

First, SCCVII cells can be utilized in cell culture to 
provide an in vitro evaluation of interferon alpha 
expression construct and formulation characteristics, such 
as expression levels and cellular toxicities. . 

Second, the cells can be implanted subcutaneously in 
mice. This system can be utilized in tests in which 
accessibility of the implant site is beneficial. As an 
example, the method was utilized in evaluations of 
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expression of 


:ranscutaneously into 


expression efficiencies based on the 
chloramphenicol acetyltransf erase (CAT) . 

Third, the cells can be implanted ti 
the fascia of digastric muscle. 

Fourth, the cells can be implanted transcutaneous^ 
into digrastric/mylohyoid muscles. The important features 
of models 3 and 4 are shown in the table below. 


TABLE I I; Comparison of submandibular tumor mode 1 < 


Feature 

Mouse Tumor Model 3 

Mouse Tumor Model 4 

Tumor implant 
procedure 

2-4 x 10 5 cells 
transcutaneously into 
fascia of digastric 
muscle 

5 x 10 5 

transcutaneously into 

digastric/mylohyoid 

muscles 

Tumor growth and 

invasiveness 

characteristics 

Prominent 

submandibular bulge; 
invasion of 
digastric/mylohyoid 
muscles and 
lymphatics 

More variable, 
invasion of 
digastric/mylohyoid 
muscles and lymphatics 

Treatment 
procedure 
{primary 
treatment) 

Transcutaneous , 
needle inserted and 
moved within tumor to 
produce a 4 quadrant 
distribution of gene 
medicine 

Lower jaw skin flap 
raised to expose 
tumor, needle inserted 
and moved within tumor 
to produce a 4 
quadrant distribution 
of gene medicine 

Days treated 
(post-implant) 

Day 5, day 10 (both 
transcutaneously) 

Day 5 (tumor exposed) , 
day 8 

(transcutaneously) 

Measurement 
procedure 

External calipering 
2-3 x per week until 
death 

First caliper when 
tumor exposed for 
treatment, second 
caliper at sacrifice 
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Feature 

Mouse Tumor Model 3 

Mouse Tumor Model 4 

Advantages 

Non-surgical , closed 
model allows larger 
experiments and more 
frequent treatments ; 
Sacrifice unnecessary 
to caliper (=more 
time points) 

Surgical, open model 
allows direct 
treatment of exposed 
tumor; Local 
inflammation from 
surgery may 

additionally stimulate 
immune response; More 
like clinical 
situation for protocol 
development 

Disadvantages 

Transcutaneous 
treatment is 
potentially less 
accurate and 
intensive; less like 
expected clinical 
treatments than 
surgical approaches 

Labor intensive; 
Smaller, fewer 
experiments possible ; 
Tumors deeper and more 
difficult to treat 
transcutaneously (for 
secondary treatments); 
Fewer treatments and 
calipe rings possible 


The tumor size treated in the mouse models is generally 
20-50 mm 3 . A 50 mm 3 mouse tumor is approximately equivalent 
to 150 cc 3 human tumor having an average diameter of about 
6.6 cm. This tumor size is approximately 10-fold larger 


5 than the size proposed to be treated in the phase I clinical 
trials. This indicates that the mouse models are strongly 
. biased towards over estimating the expected tumor burden in 
human patients. 

Iv * Formulations for In Vivo Delivery 
10 A. General 

While expression systems such as those described above 
provide the potential for expression when delivered to an 
appropriate location, it is beneficial to provide the 
expression system construct (s) in a delivery system which 
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can assist both the delivery and the cellular uptake of the 
construct. Thus, this invention also provides particular 
formulations which include one or more expression system 
constructs (e.g., DNA plasmids as described above), and a 
5 protective, interactive non-condensing compound. 

An additional significant factor relating to the 
plasmid construct is the percentage of plasmids which are in 
a supercoiled (SC) form rather than the open circular (OC) 
form. 

10 B. Delivery and Expression 

A variety of delivery methods can be used with the 
constructs and formulations described above, in particular, 
delivery by injection to the site of a tumor can be used. 
The submandibular tumor models utilized injection into four 

15 quadrants of the tumor being treated. 

C. Anti-Cancer Efficacy of Human Interferon Alpha 
Formulations 

The effects of the administration of the interferon 
alpha formulations described above were evaluated using the 
20 S.C. VII mouse tumor models. Plasmid constructs as 

described above were incorporated in delivery formulations. 
The formulations were delivered by injection. 

D. Synergistic Effects of Interferon Alpha plasmid 
and IL-12 Plasmid and Effect of Human Interferon 

25 Alpha Formulation Administration on Production of 

Secondary Cytokines 
The effects of the expression of the human interferon 
alpha plasmids in tumor cells on the progress of the mouse 
tumors demonstrates that such interferon alpha expression is 
30 effective against such tumors. However, it was also shown 
that IL-12 can act synergistically with the interferon alpha 
expression to exercise the antitumor effect (see Figure 9) . 
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E. Toxicity Evaluation of Exemplary Formulations 
The exemplary formulations do not show high cellular 
toxicity at the concentrations tested, suggesting that the 
formulations do not significantly kill cells by direct toxic 
action in vivo. Moreover, the anti-tumor activity induced 
by IFNot gene therapy is dependent upon activation of the 
immune system, which is demonstrated by depletion studies in 
vivo. Removal of a specific T lymphocyte population (CD8 + ) 
abrogates the anti-tumor activity elicited by IFNcc gene 
therapy. 

V. Administration 

Administration as used herein refers to the route of 
introduction of a plasmid or carrier of DNA into the body. 
In addition to the methods of delivery described above, the 
expression systems constructs and the delivery system 
formulations can be administered by a variety of different 
methods. 

Administration can be directly to a target tissue -or by 
targeted delivery to the target tissue after systemic 
administration. In particular, the present invention can be 
used for treating disease by administration of the 
expression system or formulation to the body in order to 
establishing controlled expression of any specific nucleic 
acid sequence within tissues at certain levels that are 
useful for gene therapy. 

The preferred means for administration of vector 
(plasmid) and use of formulations for delivery are described 
above. The preferred embodiments are by direct injection 
using needle injection. 

The route of administration of any selected vector 
construct will depend on the particular use for the expres- 
sion vectors. In general, a specific formulation for each 
vector construct used will focus on vector uptake with 
regard to the particular targeted tissue, followed by 
demonstration of efficacy. Uptake studies will include 
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uptake assays to evaluate cellular uptake of the vectors and 
expression of the DNA of choice. Such assays will also 
determine the localization of the target DNA after uptake, 
and establishing the requirements for maintenance of steady- 
state concentrations of expressed protein. Efficacy and 
cytotoxicity can then be tested. Toxicity will not only 
include cell viability but also cell function. 

Muscle cells have the unique ability to take up DNA 
from the extracellular space after simple injection of DNA 
particles as a solution, suspension, or colloid into the 
muscle. Expression of DNA by this method - can be sustained 
for several months. 

Delivery of formulated DNA vectors involves 
incorporating DNA into macromolecular complexes that undergo 
endocytosis by the target cell. Such complexes may include 
lipids, proteins, carbohydrates, synthetic organic 
compounds, or inorganic compounds. Preferably, the complex 
includes DNA, a cationic lipid, and a neutral lipid in 
particular proportions. The characteristics of the complex 
formed with the vector (size, charge, surface character- 
istics, composition) determines the bioavailability of the 
vector within the body. Other elements of the formulation 
function as ligand which interact with specific receptors on 
the surface or interior of the cell. Other elements of the 
formulation function to enhance entry into the cell, release 
from the endosome, and entry into the nucleus. 

Delivery can also be through use of DNA transporters. 
DNA transporters refers to molecules which bind to DNA 
vectors and are capable of being taken up by epidermal 
cells. DNA transporters contain a molecular complex capable 
of noncovalently binding to DNA and efficiently transporting 
the DNA through the cell membrane. It is preferable that 
the transporter also transport the DNA through the nuclear 
membrane. See, e.g., the following applications all of 
which (including drawings) are hereby incorporated by 
reference herein: (1) Woo et al., U.S. Serial No. 
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07/855,389, entitled "A DNA Transporter System and Method of 
Use,, filed March 20, 1992, now abandoned; (2) Woo et al., 
PCT/US93/02725, International Publ . W093/18759, entitled M A 
DNA Transporter System and Method of Use", (designating the 
U.S. and other countries) filed March 19, 1993; (3) 
continuation-in-part application by Woo et al . , entitled 
"Nucleic Acid Transporter Systems and Methods of Use", filed 
December 14, 1993, U.S. Serial No. 08/167,641; (4) Szoka et 
al. , U.S. Serial No. 07/913,669, entitled "Self-Assembling 
Polynucleotide Delivery System", filed July 14, 1992 and (5) 
Szoka et al., PCT/US93/03406, International , Publ . W093/19768 
entitled "Self-Assembling Polynucleotide Delivery System", 
(designating the U.S. and other countries) filed April 5, 
1993. A DNA transporter system can consist of particles 
containing several elements that are independently and non- 
covalently bound to DNA. Each element consists of a ligand 
which recognizes specific receptors or other functional 
groups such as a protein complexed with a cationic group 
that binds to DNA. Examples of cations which may be used 
are spermine, spermine derivatives, histone, cationic 
peptides and/or polylysine. One element is capable of 
binding both to the DNA vector and to a cell surface 
receptor on the target cell. Examples of such elements are 
organic compounds which interact with the asialoglycoprotein 
receptor, the folate receptor, the mannose-6-phosphate 
receptor, or the carnitine receptor. A second element is 
capable of binding both to the DNA vector and to a receptor 
on the nuclear membrane. The nuclear ligand is capable of 
recognizing and transporting a transporter system through a 
nuclear membrane. An example of such ligand is the nuclear 
targeting sequence from SV40 large T antigen or histone. A 
third element is capable of binding to both the DNA vector 
and to elements which induce episomal lysis. Examples 
include inactivated virus particles such as adenovirus, 
peptides related to influenza virus hemagglutinin, or the 
GALA peptide described in the Szoka patent cited above. 
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Transfer of genes directly into a tumor has been very 
effective. Experiments show that administration by direct 
injection of DNA into tumor cells results in expression of 
the gene in the area of injection. Injection of plasmids 
containing human interferon alpha results in expression of 
the gene for 5 days following a single intra-tumoral 
infection. Human IFNa production was highest in tumors 
harvested 1 day post-tumor injection and steadily declined 
thereafter. The injected DNA appears to persist in an 
unintegrated extrachromosomal state. This means of transfer 
is a preferred embodiment. 

Administration may also involve lipids as described in 
preferred embodiments above. The lipids may form liposomes 
which are hollow spherical vesicles composed of lipids 
arranged in unilamellar, bilamellar, or multilamellar 
fashxon and an internal aqueous space for entrapping water 
soluble compounds, such as DNA, ranging in size from 0.05 to 
several microns in diameter. Lipids may be useful without 
formxng liposomes. Specific examples include the use of 
cationic lipids and complexes containing DOPE which interact 
with DNA and with the membrane of the target cell to 
facilitate entry of DNA into the cell. 

Gene delivery can also be performed by transplanting 
genetically engineered cells. For example, immature muscle 
cells called myoblasts may be used to carry genes into the 
muscle fibers. Myoblast genetically engineered to express 
recombinant human growth hormone can secrete the growth 
hormone into the animal's blood. Secretion of the incorpor- 
ated gene can be sustained over periods up to 3 months. 

Myoblasts eventually differentiate and fuse to existing 
muscle tissue. Because the cell is incorporated into an 
existing structure, it is not just tolerated but nurtured 
Myoblasts can easily be obtained by taking muscle tissue 
from an individual who needs gene therapy and the 
genetically engineered cells can also be easily put back 
with out causing damage to the patient's muscle. Similarly, 
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keratinocytes may be used to delivery genes to tissues. 
Large numbers of keratinocytes can be generated by 
cultivation of a small biopsy. The cultures can be prepared 
as stratified sheets and when grafted to humans, generate 
epidermis which continues to improve in histotypic quality 
over many years. The keratinocytes are genetically 

engineered while in culture by transfecting the 
keratinocytes with the appropriate vector. Although 
keratinocytes are separated from the circulation by the 
basement membrane dividing the epidermis from the dermis, 
human keratinocytes secrete into circulation the protein 
produced . 

The chosen method of delivery should result in 
expression of the gene product encoded within the nucleic 
acid cassette at levels which exert an appropriate 
biological effect. The rate of expression will depend upon 
the disease, the pharmacokinetics of the vector and gene 
product, and the route of administration, but should be in 
the range 0.001-100 mg/kg of body weight /day, and 
preferably 0.01-10 mg/kg of body weight/day. This level is 
readily determinable by standard methods. It could be more 
or less depending on the optimal dosing. The duration of 
treatment will extend through the course of the disease 
symptoms, possibly continuously. The number of doses will 
depend upon the disease, delivery vehicle, and efficacy data 
from clinical trials. 

Examples 

The present invention will be more fully described in 
conjunction with the following specific examples which are 
not to be construed in any way as limiting the scope of the 
invention. As shown below, ml FN- gene medicine reduces the 
growth of tumors in syngeneic murine tumor models. Lipid 
formulations of ml FN- gene medicine display anti tumor 
activity in both SCC-VII and MC-38 tumor models. PINC and 
peptide formulations of ml FN- gene medicine display anti 
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tumor effects in the MC-38 tumor model. The anti tumor 
effects of ml FN— gene medicine are dose dependent. m 
addition, the examples demonstrate that treatment of tumors 
with the combination of IFNa and IL-12 gives an 
unanticipated more than additive (synergystic) anti-tumor 
activity using either a PINC or a lipid formulation. 

Example 1 

A plasmid expression system encoding murine IFNa4 and 
formulated in a polymeric delivery system was used for in 
vivo immunotherapeutic activity against - an immunogenic 
murine renal cell carcinoma, Renca, and a non-immunogenic 
mammary adenocarcinoma, TS/A. Mice bearing established 
tumors were treated with IFNa/polyvinyl-pyrrolidone (PVP) 
expression complexes via direct intra-tumoral injection. Up 
to 100 % tumor growth inhibition was observed in the treated 
mice. By using an optimal dose of 96 and 48 ng of 
formulated IFN-ct plasmid for the treatment of Renca and TS/A 
respectively, 30% (Renca) and 10% (TS/A) of the treated 
animals remained tumor-free. Tumor inhibition was dependent 
upon activation of the immune system. The anti-tumor 
activity elicited by I FN -a gene therapy was abrogated when 
mice were selectively depleted of CD8* T cells. By contrast 
removal of CD4 + resulted in increased tumor rejection 
following IFN-a/PVP treatments. Finally, mice that remained 
tumor-free following I FN— a gene therapy displayed immune 
resistance to a subsequent challenge of tumor. These data 
provide evidence that non-viral IFNa gene therapy can be 
used to induce an efficient anti-tumor response. 

Local presence of cytokines in tumors can activate an 
immune response that in some cases leads to induction of 
specific long-lasting anti-tumor immunity. By direct intra- 
tumoral injection of plasmid encoding murine IFN«4 and 
formulated in a polymeric delivery system, tumor-bearing 
mice develop an immune response, which leads to inhibition 
and eradication of the tumor. We have shown by depletion 
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studies in vivo that the immune response induced by IFNa is 
mainly CD8-mediated and that this treatment results in a 
long-term immunity in mice demonstrating complete tumor 
regression. Thus, non-viral IFNa gene therapy may be an 
effective alternative to IFNa protein therapy for human 
cancers . 

Transduction of tumor cells with cytokine genes has 
proven to be a very efficient technique to induce cytokine- 
mediated anti-tumor immunity. In experimental models, the 
local presence of IL-2, IL-1, IL-4, IL-6, IL-7, IL-12, IFNs 
and CSFs (i.e., GM-CSF) at the site of the tumor can result 
in significant tumor growth inhibition (Colombo et al . , 
"Local Cytokine Availability Elicits Tumor Rejection and 
Systemic Immunity Through Granulocyte-T-Lymphocyte Cross- 
Talk", Cancer Research , 52, 4853-4857 (1992)). In these 
systems, cytokines have limited effect on tumor 
proliferation directly but are capable of activating a rapid 
and potent anti-tumor immune response, which impedes tumor 
progression. Established parental tumors, however, are 
difficult to eradicate with ex vivo cytokine-transduced 
tumor cells because efficacy of vaccination is highly 
dependent on the size, growth rate and invasiveness of the 
tumor . 

To overcome these problems, cytokine-based gene therapy 
approaches, which can deliver transgenic cytokines locally 
and induce an anti-tumor immune response, have been recently 
evaluated by a number of investigators (Forni et al., 
^Cytokine-Induced Immunogenicity : From Exogenous Cytokines 
to Gene Therapy", Journal of Immunotherapy , 14, 253-257, 
(1993); Pericle et al., "An Efficient Th2-type Memory 
Follows Cd8+ Lymphocyte-driven and Eosinophil-mediated 
Rejection of a Spontaneous Mouse Mamary Adenocarcicoma 
Engineered to Release 11-4", The Journal of Immunology , 153, 
5660-5673. (1994); Pardoll et al., "Gene Modified Tumor 
Vaccines, In Cytokine-Induced Tumor Immunogenicity", eds . 
• Academic Press, London, p. 71-86. (1994); and Musiani et 


WO 99/47678 


PCT/US99/05394 


76 


10 


25 


30 


al., "Cytokines, Tumor-cell Death and Immunogenic! ty : A 
Question of Choice", Immunology Today . 1, 32-36 (1997)). 
Technological breakthroughs in gene therapy using 
adenoviral, retroviral, and liposomal vectors have provided 
powerful tools with which to study the biological effects of 
specific cytokine mediators as well as to develop novel and 
clinically applicable anti-tumor immunotherapies (Pardoll, 
"Paracrine Cytokine Adjuvants in Cancer Immunotherapy"' 

5ev. Immunol. 13, 399-415 (1995); Bramson et al., 

"Direct Intratumoral Injection of an Adenovirus Expressing 
Interleukin-12 Induces Regression and Long-Blasting Immunity 
That Is Associated with Highly Localized Expression of 
Interleukin-12", Hum. Gene Ther .. 7, 1995-2002 (1996); Rao 
et al., "11-12 Is an Effective Adjuvant to Recombinant 
15 Vaccinia Virus-based Tumor Vaccines", J. Immunol . 156, 
3357-3365. 1996; Rakhmilevich et al., -Gene Gun-mediated 
Skin Transfection with Interleukin 12 Gene Results in 
Regression of Established Primary and Metastatic Murine 
Tumors", Proc. Natl. Acad. Sci . USA . 93, 6291-6296 (1996); 
20 and Rakhmilevich et al, "Cytokine Gene Therapy of Cancer 
Using Gene Gun Technology: Superior Antitumor Activity of 
Interleukin-12", Hum. Gene Ther . 8, 1303-1311, (1997)). 

A gene therapy approach utilizing an interactive 
polymeric gene delivery system that increases protein 
expression by protecting plasmid DNA (pDNA) from nucleases 
and controlling the dispersion and retention of pDNA in 
muscle cells is described in Mumper et al., 1996. These 
polymeric interactive non-condensing (PINC) systems 
routinely result in a greater amount of gene expression from 
tissues as compared to delivery of unformulated plasmid in 
saline (Mumper et al., 1996). By using a plasmid that 
encodes human insulin growth factor-1 (hIGF-1) and 
formulated as a PINC complex, production of biologically 
active h IGF-1 in vivo following intra-muscular injection 
35 has been shown (Alila et al., "Expression of Biologically 
Active Human Insulin-Like Growth Factor-1 Following 
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Intramuscular Injection of a Formlated Plasmid in Rats", 
Human Gene Therapy , 8, 1785-1795 (1997)). The specific 
objective of this study was to determine whether a plasmid 
expression system encoding murine IFNa4 and formulated as a 
complex with PVP could induce an anti-tumor immune response 
following direct injection into subcutaneous murine tumors. 

The I FN family consists of three major glycoproteins, 
IFNcc, IFNP and IFNy. Although IFNs were first developed as 
antiviral agents, it is now clear that they also control 
cell growth and differentiation, and modulate various 
aspects of host immunity (Gresser et al., "Antitumor effects 
of interferon", Acta Oncol . 28, 347-353 (1989)). Clinical 
data concluded that systemic chronic administration, of IFNct 
could produce regression of vascular tumors, including 
Kaposi's sarcoma, pulmonary hemangiomastosis , and 
hemangiomas (Singh et al., "Interferons A and B Down- 
regulate the Expression of Basic Fibroblast Growth Factor in 
Human Carcicomas", Proc. Natl. Acad. Sci . USA . 92, 4562-4566 
(1995)). Although IFNcc was the first cytokine to be used in 
clinical trials that proved to be effective against certain 
types of human cancer, only recently has this cytokine been 
considered as a candidate for gene therapy (Ogura et al. 
1993, Belldegrun et al., "Human Renal Carcinoma Line 
Transfected With Interleukin-2 and/or Interferon a Gene(s): 

Implications for Live Cancer Vaccines, Journal of the 

National Cancer Institute , 85, 207-216 (1993). 

Initial studies have shown that the injection of 
genetically modified tumor cells producing IFNct into 
syngeneic mice induces tumor growth inhibition and elicits a 
tumor-specific immune memory (Ferrantini et al., Interferon 
Alpha-l-Interferon Gene Transfer into Metastatic Friend 
Lukemia Cells Abrogated Tumorigenicity in Immunocompetent 
Mice: Antitumor Therapy by Means of Interf eron-Producing 
Cells; Cancer Res . 53, 1107-4615 (1993); Ferrantini . et al., 
"Ifn-al Gene Expression into a Metastatic Murine 
Adenocarcicoma (Ts/a) Results in Cd8 + T Cell-Mediated Tumor 
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Rejection and Development of Antitumor Immunity: Comparative 
Studies with Ifn-y-producing Ts/a Cells- Journal of 
Immunology, 153, 4604-4615, (1994); Musiani et al. 1997). 
However, the real value of this potential form of vaccine in 
inducing the regression of established tumors remains to be 
demonstrated . 

In this study we present evidence that direct injection 
of IFNa plasmid formulated in PVP into subcutaneous murine 
tumors results in a host-dependent tumor rejection, 
primarily mediated by CD8 + T cells, and elicits a protective 
immunity against subsequent tumor re-challenge. 

Materials And Methods 

Plasmid co nstruction and formulation 

A plasmid expression system containing an expression 
15 cassette for mIFN-la4 was constructed as follows. The coding 
sequence of the murine IFN-ot4 gene (Genebank X01973 M15456 M23830 
X01967) was amplified by PGR from mouse genomic DNA. The 
amplified mIFN-a4 sequence was then subcloned into a plasmid 
backbone, and the sequence fidelity was verified by DNA sequence 
analysis (data not shown) . The coding sequence for mIFN-a4 was 
then subcloned as an Xba I -BamHl fragment into the expression 
plasmid pIL0697 to create the mIFN-a4 expression system pIF0836. 
Plasmid DVC0612 (empty plasmid, EP) contains expression elements 
including the cytomegalovirus immediate early promoter and the 3' 
25 UTR/poly(A) signal from the bovine growth gene in the pVC0289 
backbone described by Alila et al. (1997). Plasmid pVC0612 was 

used as a control plasmid in all in vivo experiments. Plasmids 
for intra-tumoral injection were grown under kanamycin selection 
in E. coli host strains DH5a and purified using conventional 
30 alkaline lysis and chromatographic methods. Purified plasmid 
utilized for intra-tumoral injections had the following 
specifications: endotoxin (< 500 Eu/mg plasmid); protein (< 1%); 
and chromosomal DNA « 20 %) . Purified pIF0836 and control 
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plasmids were formulated at a concentration of 3 mg DNA/ml 
solution of 5 % w/v polyvinylpyrrolidone (Plasdone C-30, 
Technologies, Wayne, NJ) , 150 mM NaCl on the day of injectioi 
described previously (Mumper et al., 1996). 


Western blot analysis and bioass av for mlFNa. 

HeLa cells were plated in 6 well plates at 3 x 10 5 cells 
per well, and transfected using 1 a^g of mouse IFNa4 plasmid 
P IF0836C and 3 ug of Lipof ectamine (Life Technologies, Inc., 
Gaithersburg, MD) in serum-free DMEM. Same supernatants 
were harvested 24 hours later and immunoprecipitated using 
anti-mouse interferon a/p polyclonal antibody (BioSource 
International, Camarillo, CA) and protein A and G agarose 
(Boehringer Mannheim, Indianapolis, IN) . Samples were run 
on a 12% Tris-glycine gel and electroblotted to Millipore 
PVDF membrane. Anti-mouse interferon a/p polyclonal antibody 
was used at 1:1000, followed by anti-sheep Ig HRP 
(Boehringer Mannheim) at 1:1000. Biotinylated molecular 
weight markers were detected using Streptavidin-HRP 
(Amersham, Arlington Heights, ID . Detection was performed 
using the Amersham ECL kit. Supernatants were also tested 
for IFNcc biological activity using L929 cells treated with 
encephalomyocarditis virus, in parallel with a NIH mouse 
IFNa reference reagent (Access Biomedical, San Diego, CA) . 

Animals 

Normal 8-week-old female BALB/c mice were purchased 
from Harlan Laboratories, Houston, TX. Mice were maintained 
on ad libitum rodent feed and water at 23° C, 40% humidity, 
and a 12-h/12-h light-dark cycle. Animals were acclimated 
for at least 4 days before the start of the study. 

Tumors 

Three established mouse tumor models were used in this 
study. TS/A is a tumor cell line established by Dr. P. 
Nanni, University of Bologna, Italy, from the first in vivo 
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transplant of a moderately differentiated mammary 
adenocarcinoma that spontaneously arose in a BALB/c mouse 
(Nanni et al., 1983). A number of pre-immuniza t ion- 

challenge experiments suggested that TS/A does not elicit a 
long-lasting anti-tumor immunity (Forni et al . , 1987). TS/A 
was generously provided by Dr. Guide Forni, University of 
Turin, Italy. Renca, a spontaneously arising murine renal 
cell carcinoma, and CT-26, a colon adenocarcinoma, were 
generously provided by Dr. Drew M . Pardoll, John Hopkins 
Hospital, Baltimore, MD. Tumor cell cultures were 

maintained in sterile disposable flasks* from Corning 
(Corning, NY) at 37° C in a humidified 5% C0 2 atmosphere, 
using RPMI 1640 supplemented with 10% FBS , 100 U/ml 
penicillin, 100 U/ml streptomycin and 50 ug/ml gentamycin; 
all from Life Technologies. 

In vivo evalua tion of tumor growth and treatments 

BALB/c mice were challenged s.c. in the middle of the 
left flank with 30 ul of a single-cell suspension contained 
the specified number of cells. Seven days later when the 
tumor size reached approximately 10 mm 3 , treatments with 
IFNa/PVP or EP/PVP started and were repeated at 1-2 day 
intervals for 2 weeks (total of 8 treatments: 4/week). 
Tumor volume was measured with electronic caliper in the two 
perpendicular diameters and in the depth. Measurements of 
the tumor masses (mm 3 ) were performed twice a week for 4 0-50 
days. All mice bearing tumor masses exceeding 1 cm3 volume 
were sacrificed for humane reasons. When depletion of 
immunocompetent cells in vivo was required, a group of mice 
received i.v 0.5 ml of a-CD4 (GK1.5 hybridoma, 207-TIB, 
ATCC, Rockville, MD) ascite (1:10), or a-CD8 (2.43 
hybridoma, 210-TIB, ATCC) ascite (1:100) or i.p. 100 ug 
a-GRl (RB6-8C5 hybridoma, Pharmingen, San Diego, CA) . 
Control mice received i.v. 0.5 ml isotype control IgG 
(Pharmingen). Antibody treatments were performed twice: 
first injection 1 day before starting the gene therapy 
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treatment and the second injection (i.p at the same dosage) 
7 days later. 

CTL assay 

A standard 6-hour 51-chromium ( 51 Cr) -release' assay was 
5 performed following 5 days of in vitro effector cell 
stimulation. Single cell suspensions of splenocytes were 
prepared 3 weeks following tumor challenge by mashing the 
spleens in RPMI 164 0 medium (Life Technologies) and passing 
the cells through 70 um nylon mesh cell strainers (Falcon, 

10 Becton Dickinson, Lincoln Park, NJ) into 50ml centrifuge 
tubes (Falcon) . After centrif ugation, red blood cells were 
lysed with ACK Lysing Buffer (Biofluids, Inc., Rockville, 
MD) and the splenocytes washed twice with RPMI . In vitro 
stimulation cultures contained 3 X 10 6 splenocytes/ef f ectors 

15 per ml with 6 X 10 5 mitomycin-C-treated Renca/stimulator 
cells per ml and 10 Units per ml recombinant murine IL-2 
(Genzyme, Cambridge, MA) in RPMI containing 10% FBS, 22mM 
HEPES buffer (Research Organics Inc., Cleveland OH), Penn- 
Streptomycin, 5 X 10" 5 M 2-(3-mercapto-ethanol (Life 

20 Technologies), OPI media supplement (Sigma, St. Louis, MO), 
and essential and non-essential amino acids (Life 
Technologies) (for a 5 : 1 responder : stimulator ratio) . 
Stimulators were prepared by incubating Renca cells at 3 X 
10 7 per ml in RPMI with 30 \ig per ml mitomycin-C (Sigma) at 

25 37° C for 60 minutes, followed by four washes in HBSS with 
2.5% FBS. After 5 days at 37° C, effector cells were 
pelleted, resuspended in complete RPMI, counted, and mixed 
with 51 Cr -labeled targets in a 96 well V-bottomed plate 
(Costar/Corning, Cambridge, MA). Renca and CT26 targets 

30 were labeled by incubating them at 2 X 10 6 cells per ml in 
complete RPMI with 150 uCi 51 Cr (Amersham) for 2.5 hours. 
Targets were washed 3 times in HBSS with 2.5% FBS and 
resuspended in complete RPMI before addition to the assay. 
After mixing effectors and targets (in triplicate wells) and 

35 a brief pelleting, plates were placed at 37° C for 6 hours. 
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Approximately 90% of the supernatants were then collected 
from each well with the Skatron Harvesting Press and 
Supernatant Collection System (Skatron Instruments, Norway) . 
51 Cr release was detected using a WALLAC 1470 Wizard 
automatic gamma counter (WALLAC Inc., Gaithersburg MD) . 
Specific release was determined with the following equation: 
(experimental cpm - spontaneous cpm) / (total cpm- 
spontaneous cpm) X 100. Spontaneous release from the 
targets was less than 18%, and the standard error of the 
triplicate experimental counts was less than 14%. 

Statistical analysis 

Data for the effects of mIFN-a gene therapy on tumor 
growth were analyzed by repeated measures analysis. 
Individual treatment means were compared using Duncan's 
multiple range test when the main effect was significant. 
Data for the effect of mlFN-ct gene therapy on tumor 
rejection were analyzed by AN OVA . In all cases a p value of 
less than 0.05 was considered to be statistically 
significant . 

Results 

Expression of mIFN-a 

Murine IFN-a expression plasmid (pIF0836) was 
transfected into Cos-1 cells, and the resulting conditioned 
media was assayed for mIFN-a by Western blot and by 
bioassay. Western blot analysis of conditioned media 

indicated that the recombinant mIFN-a expressed from pIF0836 
template was present as a single band with an approximate 
molecular weight of 23 kDa . This band was not observed in 
conditioned media from mock-transf ected cells and likely 
represents a glycosylated form of mIFN-a. Recombinant mlFN- 
a ran with an approximate molecular weight of 18 kDa, which 
corresponds to the predicted molecular weight of non- 
glycosylated mIFN-a. Assay of conditioned media using an 
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anti-viral bioassay for mIFN-a indicated that approximately 
175 x 10 3 IU/ml mIFN-a were present. 

Anti-tumor activity of I FN -a gene therapy. The anti- 
tumor effect of murine IFNa4 plasmid formulated as a complex 
with PVP (IFNa/PVP) was tested in a syngeneic murine renal 
cell carcinoma (Renca) and a -mammary adenocarcinoma (TS/A) 
tumor model. BALB/c mice were challenged subcutaneously 
with 7 X10 5 Renca or 1 X10 5 CT26 cells, and IFNa/PVP 
injections were initiated 7 days later when tumors reached 
approximately 10 mm 3 size. Each group of mice received at 
interval of 1-2 days 8 treatments (4 injections/week) of 
IFNa/PVP at scalar doses (from 12 to 96 jig/mouse) , EP/PVP 

(96 ng/mouse) or no treatments for control (Ctrl) . Tumor 
size increased progressively in mice injected with EP/PVP 

(Renca, TS/A) or low doses of IFNa/PVP (TS/A), while tumors 
in mice injected with each dose of IFNa/PVP (Renca) or high 
dose of IFNa/PVP (TS/A) showed marked growth inhibition. 

Tumor growth inhibition is associated to systemic immune 
response 

Treatments of Renca and TS/A tumors with IFNa/PVP at 96 
fig/mouse and 4 8 ug/mouse respectively, induced complete 
regression in 6 out of 20 (Renca) and 2 out of 20 (TS/A) of 
challenged mice. To test whether the rejection of these 
tumors leads to specific immune memory, mice with no tumors 
for 40-50 days following IFNa treatments were re-challenged 
with a greater number of fresh tumors in the right flank. 
All mice that rejected primary tumors displayed protection 
against the second tumor challenge whereas mice used as the 
control group and injected for the first time with same 
number of tumor cells (1 X 10 € Renca or 2 X 10 5 TS/A) 
developed tumors. 

To evaluate the requirements for the induction of anti- 
tumor immune memory, Renca and TS/A were injected into 
BALB/c rendered immunosuppressed by treatment with anti-CD4, 
anti-CD8 or anti-polymorphonuclear cells (PMN) . Depletion 
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of CD8 + T cells allowed both Renca and TS/A to grow in all 
animals following IFNa/PVP treatments, showing that this 
population is crucial for the immune response induced by 
IFNct gene therapy. No increase in tumor growth was found in 
mice treated with anti-PMN (a-GRl) monoclonal Ab (mAb) . 
Increase in tumor rejection was observed in mice depleted of 
CD4 + T^and treated with IFNa/PVP suggesting that depletion 
of CD4 + T cells can enhance the anti-tumor effect of IFNa 
gene therapy. 

Expression of I FN -a within the tumor induces a CTL 
response. To assess whether CD8 + tumor specific CTL were 
induced in vivo by IFNa/PVP treatments, splenocytes from 
Renca tumor-challenged mice were tested for their cytolytic 
activity following IFNa gene therapy. Cytotoxic activity 
against Renca, and not against CT26 cells used as control 
for tumor specificity, was found in 2 of 4 mice that had 
received IFNa gene therapy. Moreover, splenocytes from mice 
depleted of CD4 + T cells and treated with IFNa/PVP 
demonstrated potent CTL activity against Renca cells. By 
contrast, little CTL activity against Renca cells was 
evident from splenocytes isolated from mice treated with 
EP/PVP. 

Discussion 

The data reported herein demonstrate that direct 
administration of IFNa gene formulated in a polymeric 
delivery system into subcutaneous renal cell carcinoma and 
mammary adenocarcinoma inhibits tumor growth and induces a 
long-lasting immunity to secondary tumor challenges. Of 
considerable significance is the fact that the anti-tumor 
response was observed against both an immunogenic carcinoma 
as well a more aggressive and poorly immunogenic 
adenocarcinoma, a phenotype which is similar to many 
spontaneously arising tumors in man. 

A variety of genetic abnormalities arise in human 
cancers that contribute to neoplastic transformation and 
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malignancy. Despite increasing understanding of the 

molecular basis of cancer, many malignancies remain 
resistant to established forms of treatment. More recently, 
molecular genetic interventions have been designed in an 
attempt to improve the efficacy of immunotherapy. While 
numerous experimental studies have been performed in murine 
models with tumor cells transduced with cytokine-gene ex 
vivo, a major limitation in the translation of this strategy 
to large-scale human tumor vaccine therapy is the labor 
intensity and variability of establishing each individual 
tumor in culture and transducing it witji an appropriate 
vector (i.e., retrovirus). Our approach to this problem is 
to use a non-viral delivery system to modify tumor cells in 
vivo with cytokine cDNAs so that the tumor cells can supply 
the cytokine of interest in a paracrine fashion to the anti- 
tumor responder cells present within the tumor. 

Using a plasmid containing IFNa4 gene and formulated in 
PVP, we have shown that intra- tumoral injections of this 
DNA-PINC complex can lead to complete tumor regression in 30 
% of the cases (Renca model) with an overall response rate 
of 100 % tumor growth , inhibition . These results are in 
agreement with a recent study that described an anti-tumor 
activity elicited by genetically modified TS/A cells 
producing murine IFN-otl (Ferrantini et al., 1994). Although 
the anti-tumor effect of IFNa using cytokine-gene transduced 
tumor cells has been described (Scarpa et al., 
"Extracellular Matrix Remodelling in a Murine Mamary 
Adenocarcicoma Transfected with the Interf eron-alphal Gene", 
Journal of Pathology. 181, 116-123 1997), the real value of 
IFNa gene therapy in blocking or inhibiting advanced tumors 
remains to be explored. The advantage of using a non-viral 
IFNa gene delivery system over retrovirus is that tumor 
cells could be transduced directly in vivo without the need 
of first establishing tumor cells in vitro. Moreover, IFNa 
has a potent anti-viral activity limiting the use of this 
gene in combination with viral vectors. 
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Therapeutic levels of gene expression for IGF-I using a 
similar interactive PVP-based delivery system have previusly 
been described (Alila et al., 1997). Direct intra-tumor 
injection of the same PINC delivery system as a complex with 
5 IFNct gene, resulted in dispersion in vivo of plasmid into 
target cells inducing an IFNct-specif ic anti-tumor activity. 
Tumors treated with the same plasmid but in the absence of 
IFNa coding region and formulated as a complex with PVP, did 
not respond to this treatment and grew in a similar rate to 

10 untreated tumors. By using an optimal dose of IFNa/PVP, 
tumor-bearing mice were able to reject the -tumors mounting a 
specific long-lasting tumor immunity. Although, the numbers 
of mice rejecting a second tumor challenge was low, this 
observation indicates that a considerable portion of the 

15 activity of IFNa in inducing the rejection of established 
tumors is not anti-angiogenic or anti-prolif erative but 
immunostimulatory. Our result demonstrating that IFNa- 
induced regression of advanced tumors was prevented by in 
vivo administration of anti-CD8 mAb provides direct evidence 

20 for a key role of CD8 + T cells in the anti-tumor effect of 
IFNa. 

Depletion of CD4 + T cells in tumor-bearing mice treated 
with IFNa gene therapy significantly enhanced the 
therapeutic effect of IFNa, resulting in tumor regression 

25 and prolonged survival of up to 80% of treated mice. A CD4- 
mediated suppression during tumor progression has been 
previously reported and it has also been shown that 
depletion of CD4 + T cells in tumor-bearing mice results in 
augmentation of anti-tumor therapy with either IL-2 or IL-12 

30 (Rackmilevich et al., 1994 and Martinotti et al., "Cd4 T 
Cells Inhibit in Vivo the Cd8-Mediated Immune Response 
Against Murine Colon Carcinoma Cells Transducted with 
Interleukin-12 Genes", Eur. J. Immunol , 25, 137-146. 
(1995)). They have shown that depletion of CD4 + T cells in 

35 tumor-bearing mice in the absence of treatment did not 
affect the growth of tumor itself suggesting that removal of 
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CD4 + T cells does not deprive the tumor of growth factors 
(Rackmilevich et al., 1994). A possible explanation for 
this phenomenon is that depletion of CD4 + T cells from tumor- 
bearing mice augments the anti-tumor efficacy of IFNa- 
5 activated CD8 + T cells by releasing them from 
immunosuppression. The mechanism driving CD4 suppression is 
poorly understood, although Th2 type cytokines, directly or 
through B cell activation, may inhibit cell-mediated 
immunity (Mossman et al . , 1989; Powrie et al., Eur- J- 

10 Immunol , 23 ( 11 ): 3043-9 (1993)). CTL can be generated in 
both CD4 -depleted and non-depleted mice from lymphocytes 
obtained from spleens by in vitro re-stimulation with 
mitomycin-treated Renca cells and rIL-2. Thus, CD4-mediated 
suppression appears to be exerted on CD8 expansion and not 

15 priming. In accord with the in vivo results, stronger CTL 
activity was observed from mice depleted of CD4 and treated 
with IFNcc/PVP suggesting CD4 + T cells inhibit an IFNa- 
mediated CD8 + T cell response in vivo. This study suggests 
that direct administration of cytokine genes, like IFNa, 

20 into tumors, which have been found to suppress malignancy 
growth, provide a new therapeutic option for the treatment 
of human cancers . 

Example 2: Pharmacology of ml FN - Gene Medicine in Syngeneic 
Tumor Models 

25 Gene expression systems encoding either mIFN-2 or mlFN- 

4 were tested for anti tumor activity when formulated in 
either cationic lipid, peptide, or PINC delivery systems and 
injected intratumorally into subcutaneous squamous cell 
carcinoma (SCC-VII) or adenocarcinoma (MC-38) tumors. 

30 Experimental design and treatment regimen 

Experiments to test the anti tumor activity of mlFN- 

gene medicine were conducted in either SCC-VII or MC-38 

tumor models. Tumor cells (4 x 10 5 ) were injected 

subcutaneously into the flank region of mice, and treatment 
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was initiated when tumor volume reached approximately 50 mm 3 . 
Treatment was begun approximately 6 days (SCC-VII tumors) 
and 10 days (MC-38 tumors) after tumor initiation and 
repeated at 3 to 5 day intervals. 
5 All formulations of ml FN- gene medicine were 

administered in a dose volume of 50 pi. The faster growing 
SCC-VII tumors typically received 3 treatments, whereas the 
relatively slower growing MC-38 tumors typically received A 
treatments. Experiments were terminated when lactose 

10 vehicle control tumors reached approximately 1000 mm 3 . 

The anti-tumor effects of murine I FN gene medicine 
(IFNa/PVP) was tested in syngeneic murine renal cell 
carcinoma (Renca) and mammary adenocarcinoma (TS/A) tumor 
model. BALB/c mice were challenged subcutaneously with 7 

15 X10 5 or 1 X10 5 CT26, and IFNa/PVP injections were initiated 7 
days later when tumors reached approximately 10 mm 3 size. 
Each group of mice received 8 treatments (4 injections for 2 
weeks) of IFNa/PVP at scalar doses (from 12 to 96 ug/mouse) , 
empty plasmid/PVP (EP/PVP, 96 ng/mouse) or no treatments for 

20 control (ctrl) . Tumor size increased progressively in mice 
injected with EP/PVP {Renca, TS/A) or low doses of IFNa/PVP 
(TS/A) , while tumors in mice injected with each dose of 
IFNa/PVP (Renca) or high dose of IFNa/PVP (TS/A) showed 
marked growth inhibition. 

25 Example 3: ml FN- Gene Medicine Formulated in Cationic lipid 
Reduces the Growth of SCC-VII Tumors 

Experiments were conducted in the SCC-VII tumor model 
as described in the preceding example. mIFN- gene medicine 
formulated in cationic lipid, peptide, and PINC delivery 

30 systems was tested. Results show that cationic lipid 
formulations significantly reduce the growth of SCC-VII 
tumors relative both to lactose vehicle injected tumors and 
to tumors injected with control (non coding) plasmid 
formulated in cationic lipid. The effect of ml FN- gene 

35 medicine formulated in cationic lipid is dose dependent and 
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there is no effect of mIFN- gene medicine when formulated 
in PVA. In addition, analysis of tumors from this 

experiment using immunohistochemical methods revealed 
infiltration of CD8 + lymphocytes in tumors injected with 
5 cationic lipid formulations, but not in tumors injected with 
PVA formulations. 

ml FN- gene medicine significantly reduces the growth of 
SCC-VII tumors as compared to control plasmid or lactose 
injected tumors. Differences between control plasmid and 

10 ml FN- plasmid were consistent across formulation. Plasmid 
dose was 4 6 ug/treatment . Growth of tumours injected with 
control plasmid was compared to that of tumors injected with 
mIFN- gene medicine using repeated measures analysis, 
ml FN- reduced SCC-VII tumor growth relative to control 

15 plasmid (p= . 035 ) . 

Example 4: ml FN- Gene Medicine Reduces the Growth of MC-38 
Tumors 

Experiments were carried out as described in the 
preceding examples. The effects of various prototype 

20 formulations of mIFN- gene medicine on . the growth of 
subcutaneous MC-38 tumors were compared. mIFN- gene 

medicine elicited reduced tumor growth in all formulations 
tested (cationic lipid, peptide, and PINC) . Subsequent 
experiments in the MC-38 tumor model have shown similar 

25 results. 

Example 5: Dose Responses 

After demonstrating anti tumor effects of ml FN- gene 
medicine, the dose response for these effects was 
investigated in the MC-38 tumor model. Both cationic lipid 
30 (D0TMA:Chol) and PINC (PVA) delivery systems were evaluated. 
Results clearly show that mIFN- gene medicine elicited a 
dose dependent reduction in tumor growth. Tumor volume in 
this experiment was maximally reduced by approximately 50 % 
with ml FN- /DOTMA:Chol and 60 % with ml FN- /PVA after 4 
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treatments. Maximal reduction in tumor volume was observed 
at a plasmid dose of approximately 50 pg/treatment 
(cumulative dose of approximately 200 ug) . These 
experiments will be conducted primarily in the MC-38 tumor 
model because it provides a broader treatment window than 
does the SCC-VII model. 

Example 6: IFN-alpha Formulations 

The formulations for the IFN-a are: (1) PVP 4 vial, (2) 

PVP three vial, (3) PVP two vial. The details are listed 
below: 

PVP 4 vial 

Materials: 25% PVP (50 kDa) stock solution, plasmid 
stock solution, 5 M NaCl stock solution, and water. 

Method: Add in order of water, plasmid, 25% PVP and 5 M 
NaCl into a vial to make a plasmid in 5% PVP in saline 
formulation. The final concentration of PVP and NaCl are 
fixed (5% and 150 mM) and plasmid concentration could be 
varied (but based on the IGF-1 data, 3 mg DNA/ml in 5% PVP 
in saline should be the best formulation) . The quality of 
the formulation is characterized by pH, DNA concentration, 
osmolality, and gel electrophoresis. The DNA concentration 
could be varied from 0.1-5 mg/ml. The pH may be varied from 
3-5, osmolality may be 250 - 400 mOsm. 

Three vial 

Material: lyophilized PVP, plasmid stock solution (4 
mg/ml), 115 mM Na-Cit rate/5% NaCl stock buffer (pH = 4). 

Method: Add in order of plasmid and buffer into the PVP 
to make final 3 mg DNA/ml in 5% PVP in 25 mM citrate/saline 
buffer (pH =4) . DNA expression is higher in saline than in 
the citrate buffer. 
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Two vial 

Materials: Co-lyophilized plasmid and PVP, saline. Add 
saline into the co-lyophilized DNA and PVP to make final 3 
mg/ml DNA in 5% PVP in saline. 

The final formulation is 3mg/mL DNA, 5% PVP as a single 
vial. The formulation is prepared by adding (5%) PVP to 
(4mg/mL) DNA and recirculating the two components for a 
finite period of time (using static mixer) . Then the 
formulation is lyophilized and rehydrated with 0.9% sodium 
chloride, to obtain a final composition of 3mg/mL, 5%PVP in 
saline. 

Example 7 : Treatment of Human Tumors 

The murine studies are predictive of the response of 
Human tumors to therapy using a plasmid construct encoding 
the human I FN alpha gene sequence such as that depicted in 
SEQ ID NO: 10, 11 or 12. A patient in need of anti-cancer 
therapy is injected with up to 3mg of plasmid formulation at 
daily intervals. The plasmid formulation may contain INF 
alpha plasmid alone or optionally a mixture of IFN-alpha 
encoding plasmid and an additional plasmid coding for a 
cytokine. The preffered cytokine is IL-12. The treatments 
are continued and the patient monitored as is the usual 
practice for anti-cancer chemotherapeutic regimes. 

One skilled in the art would readily appreciate that 
the present invention is well adapted to carry out the 
objects and obtain the ends and advantages mentioned, as 
well as those inherent therein. The molecular complexes and 
the methods, procedures, treatments, molecules, specific 
compounds described herein are presently representative of 
preferred embodiments are exemplary and are not intended as 
limitations on the scope of the invention. Changes therein 
and other uses will occur to those skilled in the art which 
are encompassed within the spirit of the invention are 
defined by the scope of the claims. 
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It will be readily apparent to one skilled in the art 
that varying substitutions and modifications may be made to 
the invention disclosed herein without departing from the 
scope and spirit of the invention. 
5 All patents and publications mentioned in the speci- 

fication are indicative of the levels of those skilled in 
the art to which the invention pertains. All patents and 
publications are herein incorporated by reference to the 
same extent as if each individual publication was specific- 
10 ally and individually indicated to be incorporated by 
reference. 

The invention illustratively described herein suitably 
may be practiced in the absence of any element or elements, 
limitation or limitations which is not specifically 
15 disclosed herein. Thus, for example, in each instance 
herein any of the terms "comprising", "consisting 
essentially of" and "consisting of" may be replaced with 
either of the other two terms. The terms and expressions 
which have been employed are used as terms of description 
and not of limitation, and there is no intention that in the 
use of such terms and expressions of excluding any 
equivalents of the features shown and described or portions 
thereof, but it is recognized that various modifications are 
possible within the scope of the invention claimed. Thus, 
25 it should be understood that although the present invention 
has been specifically disclosed by preferred embodiments and 
optional features, modification and variation of the 
concepts herein disclosed may be resorted to by those 
skilled in the art, and that such modifications and 
30 variations are considered to be within the scope of this 
invention as defined by the appended claims. 

In addition, where features or aspects of the invention 
are described in terms of Markush groups, those skilled in 
the art will recognize that the invention is also thereby 
35 described in terms of any individual member or subgroup of 
members of the Markush group. For example, if X is 


20 
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described as selected from the group consisting of bromine, 
chlorine, and iodine, claims for X being bromine and claims 
for X being bromine and chlorine are fully described. 

Other embodiments are within the following claims. 
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Claims 

1 • A plasmid comprising a CMV promoter 
transcriptionally linked with an interferon alpha coding 
sequence, and a growth hormone 3 1 -untranslated region. 

2. The plasmid of claim 1, wherein said interferon 
alpha is human interferon alpha. 

3. The plasmid of claim 2, wherein said human 
interferon alpha coding sequence is a synthetic sequence 
having optimal codon usage. 

4. The plasmid of claim 3, wherein said interferon 
alpha coding sequence has the nucleotide sequence of SEQ ID 
NO:10, 11 or 12. 

5. The plasmid of claim 1, wherein said growth 
hormone 3* untranslated region is from a human growth 
hormone gene. 

6. The plasmid of claim 5, wherein an ALU repeat or 
ALU repeat-like sequence is deleted from said 3' 
untranslated region. 

7. The plasmid of claim 1, wherein said plasmid 
includes a promoter, a TATA box, a Cap site and a first 
intron and intron/exon boundary in appropriate relationship 
for expression of said coding sequence. 

8. The plasmid of claim 7, wherein said plasmid 
further comprises a 5* mRNA leader sequence inserted between 
said promoter and said coding sequence. 
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9. The plasmid of claim 1, wherein said plasmid 
further comprises an intron/5' UTR from a chicken skeletal 
a-actin gene. 

10. The plasmid of claim 1, wherein said plasmid 
5 comprises a nucleotide sequence which is the same as the 

nucleotide sequence of plasmid pIF0921. 

11. The plasmid of claim 1, further comprising: 

a first transcription unit comprising a first 
transcriptional control sequence transcriptionally linked 

10 with a first 5 1 -untranslated region, a first intron, a first 
coding sequence, and a first 3 ' -untranslated region/poly (A) 
signal, wherein said first intron is between said control 
sequence and said first coding sequence; and 

a second transcription unit comprising a second 

15 transcriptional control sequence transcriptionally linked 
with a second 5 1 -untranslated region, a second intron, a 
second coding sequence, and a second 3 ' -untranslated 
region/poly (A) signal, wherein said second intron is between 
said control sequence and said second coding sequence; 

20 wherein said first and second coding sequences 

comprise a sequence having the sequence of SEQ ID NO: 2, 3, 4 
or 25 coding for human IL-12 p40 subunit, and a sequence 
having the sequence of SEQ ID NO: 6, 7, 8 or 24 coding for 
human IL-12 p35 subunit. 

25 12. The plasmid of claim 11, wherein said first 

transcriptional control sequence or said second 
transcriptional control sequence comprise one or more 
cytomegalovirus promoter sequences. 

13. The plasmid of claim 11, wherein said first and 
30 second transcriptional control sequences are the same. 
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14. The plasmid of claim 11, wherein said first and 
second transcriptional control sequences are different. 

15. The plasmid of claim 14, wherein said sequence 
coding for the p40 subunit of human IL-12 is 5' to said 

5 sequence coding for the p35 subunit of human IL-12. 

16. The plasmid of claim 1, further comprising an 
intron having variable splicing, a first coding sequence, 
and a second coding sequence, 

wherein said first and second -coding sequences 
10 comprise a sequence having the sequence of SEQ ID NO: 2, 3, 4 
or 25 coding for human IL-12 p40 subunit, and a sequence 
having the sequence of SEQ ID NO: 6, 7, 8 or 24 coding for 
human IL-12 p35 subunit. 

17. The plasmid of claim 16, further comprising: 

15 a transcriptional control sequence 

transcriptionally linked with a first coding sequence and a 
second coding sequence; 

a 5 1 -untranslated region; 

an intron 5' to said first coding sequence; 
20 an alternative splice site 3' to said first coding 

sequence and 5* to said second coding sequence; and 
a 3 1 -untranslated region/poly (A) signal. 

18. The plasmid of claim 17, wherein said 
transcriptional control sequence comprises a cytomegalovirus 

25 promoter sequence. 

19. The plasmid of claim 1, further comprising: 

a transcriptional control sequence 

transcriptionally linked with a first coding sequence, an 
IRES sequence, a second coding sequence, and a 3'- 
30 untranslated region/poly (A) signal, wherein said IRES 
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sequence is between said first coding sequence and said 
second coding sequence; and 

an intron between said promoter and said first 
coding sequence; 

5 wherein said first and second coding sequences 

comprise a sequence having the sequence of SEQ ID NO: 2, 3, 4 
or 25 coding for human IL-12 p40 subunit, and a sequence 
having the sequence of SEQ ID NO: 6, 7, 8 or 24 coding for 
human IL-12 p35 subunit. 

10 20. The plasmid of claim 19,.- wherein said 

transcriptional control sequence comprises a cytomegalovirus 
promoter sequence . 

21. The plasmid of claim 19, wherein said IRES 
sequence is from an encephalomyocardit is virus. 

15 22. A composition comprising the plasmid of anyone of 

claims 1-21, and a protective, interactive non-condensing 
compound. 

23. The composition of claim 22 , wherein said 
protective, interactive non-condensing compound is polyvinyl 

20 pyrrolidone. 

24. The composition of claim 22, wherein said plasmid 
is in a solution having between 0.5% and 50% PVP. 

25. The composition of claim 24, wherein said solution 
includes about 5% PVP. 

25 26. The composition of claim 22, wherein said DNA is 

at least about 80% supercoiled. 

27. The composition of claim 26, wherein said DNA is 
at least about 90% supercoiled. 
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28. The composition of claim 27, wherein said DNA is 
at least about 95% supercoiled. 

29. A composition comprising a protective, interactive 
non-condensing compound and a plasmid comprising an 
interferon alpha coding sequence. 

30. A composition comprising the plasmid of any one of 
claims 1-21 and a cationic lipid with a neutral co- 
lipid. 

31. The composition of claim 30, wherein said cationic 
lipid is DOTMA. 

32. The composition of claim 30, wherein said neutral 
co-lipid is cholesterol. 

33. The composition of claim 30, wherein the DNA in 
said plasmid and said cationic lipid are present in such 
amounts that the negative to positive charge ratio is about 
1:3. 

34. The composition of claim 30, wherein said DNA is 
at least about 80% supercoiled. 

35. The composition of claim 34, wherein said DNA is 
at least about 90% supercoiled. 

36. The composition of claim 35, wherein said DNA is 
at least about 95% supercoiled. 


37. The composition of claim 30, further comprising an 
isotonic carbohydrate solution. 
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38. The composition of claim 37, wherein said isotonic 
carbohydrate solution consists essentially of about 10% 
lactose . 

39 The composition of claim 30 wherein said cationic 
lipid and said neutral co-lipid are prepared as a liposome 
having an extrusion size of about 800 nanometers. 

40. A composition comprising: 

a first component comprising a plasmid comprising 
an interferon alpha coding sequence and a cationic lipid 
) with a neutral co-lipid, wherein said cationic lipid is 
DOTMA and said neutral co-lipid is cholesterol, wherein the 
DNA in said plasmid and said cationic lipid are present in 
amounts such that the negative to positive charge ratio is 
about 1:3; and 

5 a second component comprising a protective, 

interactive non-condensing compound, wherein said first 
component is present within the second component. 

41 A composition comprising a protective, interactive 
non-condensing compound, a first plasmid comprising an 
0 interferon alpha coding sequence, and one or more other 
plasmids independently comprising an IL-12 or IL-12 subumt 
coding sequence. 

42. A composition comprising a plasmid comprising an 
interferon alpha coding sequence and a cationic lipid with a 
>5 neutral co-lipid. 

4 3 A method for making a plasmid of anyone of claims 
1-21 comprising the step of inserting a CMV promoter 
transcriptionally linked with an interferon alpha coding 
sequence, and a growth hormone 3 ■ -untranslated region into a 
30 plasmid. 
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44. A method for making a composition of claim 29 
comprising the steps of: 

a. preparing a DNA molecule comprising a 
transcriptional unit, wherein said transcriptional unit 

5 comprises an interferon alpha coding sequence; 

b. preparing a protective, interactive non- 
condensing compound; and 

c combining said protective, interactive non- 
condensing compound with said DNA in conditions such that a 
10 composition capable of delivering a therapeutically 

—IT" r 011 "' ° f ^ interfer °" ^ sequence to I 

mammal is formed. 

45. The method of claim 4 A wherein said DNA molecule 

15 tL\ 7"' Wherein Sa±d PlaSmid COI "P rises * CMV promoter 

15 transcriptionally linked with an interferon alp ha coding 

sequence, and a human growth hormone 3 • -untranslatel 
region/poly(A) signal. 6 ° 


46. A method of making a composition of claim 30 
comprising the steps of: 

alnh. Preparing 3 DNA uprising an interferon 

aipna coding sequence; 

b. preparing a mixture of a cationic lipid and a 
neutral co-l ip id, wherein said cationic lipid is DOTMA and 
said neutral co-lipid is cholesterol; and 

c. combining said mixture with said DNA in 
amounts such that said cationic lipid and said DNA are 
present in a negative to positive charge ratio of about 1:3. 

47. A method of making a composition of claim 40 
comprising the steps of: ' 

plasmid 3 ' • Pr6Paring 3 first «>*>P°nent comprising a 
Plasmid comprising an interferon alpha coding sequence and a 
cationic Hp ld with , neutrgl co _ lipid , where . n said 
cationic Hp id is DOTMA and said neutral co-lipid 
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cholesterol, wherein the DNA in said plasmid and said 
cationic lipid are present in amounts such that the negative 
to positive charge ratio is about 1:3; 

b. preparing a second component comprising a 
5 protective, interactive non-condensing compound; and 

c. combining said first and second components 
such that the resulting composition comprises said first 
component within said second component. 

48. A method of making a composition of claim 41, 
10 comprising the steps of: 

a. preparing a protective, interactive non- 
condensing compound, 

b. preparing a first plasmid comprising an 
interferon alpha coding sequence, 

15 c. preparing one or more other plasmids 

independently comprising an IL-12 p35 or IL-12 p40 subunit 
coding sequence , and 

d. combining said protective, interactive non- 
condensing compound, said plasmid comprising said interferon 

20 alpha coding sequence and said other plasmids. 

4 9. A method of making a composition of claim 42 
comprising combing a plasmid comprising a interf eronalpha 
coding sequence and a cationic lipid with a neutral co- 
lipid. 

25 50. A method for treatment of a mammalian condition or 

disease, comprising administering to a mammal suffering from 
said condition or disease a therapeutically effective amount 
of a plasmid of anyone of claims 1-21. 

51. The method of claim 50, wherein said condition or 
30 disease is a cancer. 
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52. The method of claim 50, wherein said composition 
is administered by injection. 

53. A method for transfection of a cell in situ, 
comprising the step of contacting said cell with a plasmid 
of anyone of claims 1-21 for sufficient time to transfect 
said cell. 

54. The method of claim 53, wherein transfection of 
said cell is performed in vivo. 

55. The method of claim 53, wherein said contacting is 
performed in the presence of an about 5% PVP solution. 

56. A method for delivery and expression of an 
interferon alpha gene in a plurality of cells, comprising 
the steps of: 

(a) transfecting said plurality of cells with a 
plasmid of anyone of claims 1-21; and 

(b) incubating said plurality of cells under 
conditions allowing expression of a nucleic, acid sequence in 
said vector, wherein said nucleic acid sequence encodes 
interferon alpha. 

57. The method of claim 56, wherein said interferon 
alpha is human interferon alpha and said cells are human 
cells . 

58. The method of claim 56, wherein said contacting is 
performed in the presence of an about 5% PVP solution. 

59. A method for treating a disease or condition, 
comprising the steps of transfecting a cell in situ with a 
plasmid of any one of claims 1-21. 
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60. The method of claim 59, wherein said disease or 
condition is a localized disease or condition. 

61. The method of claim 59, wherein said disease of 
condition is a systemic disease or condition. 

5 62 . A cell transfected with a plasmid of anyone of 

claims 1-21. 

63. A method for treatment of a mammalian condition or 
disease, comprising administering to a mammal suffering from 
said condition or disease a therapeutically effective amount 

10 of a composition of claim 22. 

64. A method for treatment of a mammalian condition or 
disease, comprising administering to a mammal suffering from 
said condition or disease a therapeutically effective amount 
of a composition of claim 29. 

15 65. A method for treatment of a mammalian condition or 

disease, comprising administering to a mammal suffering from 
said condition or disease a therapeutically effective amount 
of a composition of claim 30. 

66. A method for treatment of a mammalian condition or 
20 disease, comprising administering to a mammal suffering from 

said condition or disease a therapeutically effective amount 
of a composition of claim 40. 

67 . A method for treatment of a mammalian condition or 
disease, comprising administering to a mammal suffering from 

25 said condition or disease a therapeutically effective amount 
of a composition of claim 41. 

68. A method for treatment of a mammalian condition or 
disease, comprising administering to a mammal suffering from 
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said condition or disease a therapeutically effective amount 
of a composition of claim 42. 

69. A method for treatment of a mammalian condition or 
disease, comprising administering to a mammal suffering from 
5 said condition or disease a therapeutically effective amount 
of a composition of a first plasmid comprising an interferon 
alpha coding sequence and a second plasmid comprising a IL- 
12 coding sequence. 
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<110> NORDSTROM, JEFF; PERICLE, FEDERICA; ROLLAND, 

ALLAIN; RALSTON, ROBERT 

<120> INTERFERON ALPHA PLASMIDS AND DELIVERY SYSTEMS, 

5 AND METHODS OF MAKING AND USING THE SAME 

<150> US 08/949,160 and PCT/US97 /1877 9 

<151> October 10, 1997 

<160> 25 

<210> 1 

10 <211> 328 

<212> amino acid 

<400> 1 

Met Cys His Gin Gin Leu Val He Ser Trp Phe Ser Leu Val Phe Leu 

15 1 5 10 15 

Ala Ser Pro Leu Val Ala He Trp Glu Leu Lys Lys Asp Val Tyr Val 
20 25 30 

20 Val Glu Leu Asp Trp Tyr Pro Asp Ala Pro Gly Glu Met Val Val Leu 
35 40 45 

Thr Cys Asp Thr Pro Glu Glu Asp Gly He Thr Trp Thr Leu Asp Gin 
50 55 60 . 

25 

Ser Ser Glu Val Leu Gly Ser Gly Lys Thr Leu Thr He Gin Val Lys 
65 70 75 80 

Glu Phe Gly Asp Ala Gly Gin Tyr Thr Cys His Lys Gly Gly Glu Val 
30 85 90 95 

Leu Ser His Ser Leu Leu Leu Leu His Lys Lys Glu Asp Gly He Trp 
100 105 110 

35 Ser Thr Asp He Leu Lys Asp Gin Lys Glu Pro Lys Asn Lys Thr Phe 
115 120 125 

Leu Arg Cys Glu Ala Lys Asn Tyr Ser Gly Arg Phe Thr Cys Trp Trp 
130 135 140 

40 

Leu Thr Thr He Ser Thr Asp Leu Thr Phe Ser Val Lys Ser Ser Arg 
145 150 155 160 

Gly Ser Ser Asp Pro Gin Gly Val Thr Cys Gly Ala Ala Thr Leu Ser 
45 165 170 175 

Ala Glu Arg Val Arg Gly Asp Asn Lys Glu Tyr Glu Tyr Ser Val Glu 
180 185 190 
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Cys Gin Glu Asp Ser Ala Cys Pro Ala Ala Glu Glu Ser Leu Pro He 
195 200 205 

5 Glu Val Met Val Asp Ala Val His Lys Leu Lys Tyr Glu Asn Tyr Thr 

215 220 

Ser Ser Phe Phe He Arg Asp He He Lys Pro Asp Pro Pro Lys Asn 

10 235 240 

Leu Gin Leu Lys Pro Leu Lys Asn Ser Arg Gin Val Glu Val Ser Tro 
245 250 255 

Glu Tyr Pro Asp Thr Trp Ser Thr Pro His Ser Tyr Phe Ser Leu Thr 
260 265 270 

Phe Cys Val Gin Val Gin Gly Lys Ser Lys Arg Glu -Lys Lys Asp Arg 

280 285 

20 Val Phe Thr Asp Lys Thr Ser Ala Thr Val He Cys Arg Lys Asn Ala 
yu 295 300 

Ser lie Ser Val Arg Ala Gin Asp Arg Tyr Tyr Ser Ser Ser Trp Ser 
25 315 320 

Glu Trp Ala Ser Val Pro Cys Ser 
325 

<210> 2 
30 <211> 987 

<212> nucleic acid 

<400> 2 


^ ^SZ^ TCACC AG CAGTTGGT CATCTCTTGG TTTTCCCTGG TTTTTCTGGC ATCTCCrPTr 

B*™™** GAAAGATGTT TATGTCGTAG AATTGGATTG J 

arSSS™ AAATGGTGGT CCTCACCTGT GACACCCCTG AAGAAGATGG TATCACCTGG lie 

ACCTTGGACC AGAGCAGTGA GGTCTTAGGC TCTGGCAAAA CCCTGACCAT CCAAGT^AA l* r 

ATGCTGGCCA GTACACCTGT CACAAAGGAG GcSg^CT ^CAT^ 300 

4 0 aISI^ C ttcacaaaaa ggaagatgga atttggtcca ctgatatttt aa^gg^cSg llo 

^^ Ch AAAATAAGAC CTTTCTAAGA TGCGAGGCCA AGAATTATTC TGGACGTTTC stn 

A ^S CTGGT GGCT GACGAC AATCAGTACT GATTTGACAT TCAGTGTCAA A^GCA^IrA lln 

GGCTCTTCTG ACCCCCAAGG GGTGACGTGC GGAGCTGCTA CACTCTC^ AGAGAGAGTC JSS 

cr™^ ACAAGGAGTA TGAGTACTCA GTGGAGTGCC AGGAGGACAG TGCCTGCCCA 600 

4 5 g£SSS^ AGAGTCTG CC CATTGAGGTC ATGGTGGATG CCGTTCACAA GCTC^g5a A 660 

3D GAAAACTACA CCAGCAGCTT CTTCATCAGG G AC AT CATC A AACCTGACCC arrraanl^ So2 

llc^r^ AGCCATTAAA GAATTCTCGG CAGGTG^GG T^AGCTGGGA S£S£££ Vso 

A?rInr A ?n A ^ TCCACATTC CTACTTCTCC CTGACATTCT GCGTTCAGGT CCAGGGCAAG 840 

rrrnt^S ^ AAGAAAGA TAGAGTCTTC ACGGACAAGA CCTCAGCCAC GGTCATCTGC lan 

50 GA^TGGGCAT SEE?** SSSS SSS 
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<210> 3 
<211> 987 
<212> nucleic acid 

<400> 3 

5 wrTrrCACC AGCAGCTGGT GATCAGCTGG TTCAGCCTGG TGTTCCTGGC CAGCCCCCTG 

g?gSc? gggagc?gaI g^aggacgtg TACGTGGTGG agctggactg gtaccccgac 

GCCCCCGGCG AGATGGTGGT GCTGACCTGC GACACCCCCG AGGAGGACGG CATCACCTGG 180 
ACCCTGGACC AGAGCAGCGA GGTGCTGGGC AGCGGCAAGA CCCTGACCAT CCAGGTGAAG 240 
1 n rAGTTCGGCG ACGCCGGCCA GTACACCTGC CACAAGGGCG GCGAGGTGCT GAGCCACAGC 300 

1 ctgSgctgc ?gcacaagS ggaggacggc ATCTGGAGCA ccgacatcct gaaggaccag 360 

aIgGAGCCCA AGAACAAGAC CTTCCTGCGC TGCGAGGCCA AGAACTACAG CGGCCGCTTC 420 

ACCTGCTGGT GGCTGACCAC CATCAGCACC GACCTGACCT TCAGCGTGAA GAGCAGCAGG 480 

rrrlrCAGCG ACCCCCAGGG CGTGACCTGC GGCGCCGCCA CCCTGAGCGC CG AG CGCGTG 54 0 

15 CGCGGCGACA ACAAGGAGTA CGAGTACAGC GTGGAGTGCC AGGAGGACAG CGCCTGCCCC 600 

GCCGCCGAGG AGAGCCTGCC CATCGAGGTG ATGGTGGACG CCGTCCACAA GCTGAAGTAC 660 

GAgSc^CA CCAGCAGCTT CTTCATCCGC GACATCATCA AGCCCGACCC CCCCAAGAAC 720 

CTGCAGCTGA AGCCCCTGAA GAACAGCCGC CAGGTGGAGG TGAGCTGGGA GTACCCCGAC 780 

ACCTGGAGCA CCCCCCACAG CTACTTCAGC CTGACCTTCT GCGTGCAGGT GCAGGGCAAG 840 

20 AGCAAGCGCG S£aGGA CCGCGTGTTC ACCGACAAGA CCAGCGCCAC CGTGATCTGC 900 
CGCAAGAACG CCAGCATCAG CGTGCGCGCC CAGGACCGCT ACTACAGCAG C AG CTGGAG C 
GAGTGGGCCA GCGTGCCCTG CAGCTAG 


60 
120 


900 
960 
987 


<210> 4 

25 <211> 987 

<212> nucleic acid 

<400> 4 


30 


35 


<210> 5 

<211> 219 

<212> amino acid 

50 <400> 5 


60 
120 


ATGTGCCACC AGCAGCTGGT GATCAGCTGG TTCTCCCTGG TGTTTCTGGC CAGCCCCCTC 
GTGGCCATCT GGGAGCTGAA GAAAGACGTG TACGTGGTCG AGCTGGACTG GTACCCCGAC 

GCCCCCGGCG AGATGGTGGT CGTGACCTGC GACACCCCCG AGGAAGACGG CATCACCTGG 180 

ACCCTGGACC AGAGCAGTGA GGTGCTGGGC TCCGGCAAGA CCCTGACCAT CCAGGTGAAG 240 

GAGTTCGGCG ACGCCGGCCA GTACACCTGC CACAAGGGAG GCGAGGTGCT GAGCCACTCC 300 

CTCCTGCTGC TCCACAAAAA GGAGGACGGC ATCTGGAGCA CCGACATCCT GAAGGACCAG 360 

aIg^AGCCCA AGAACAAGAC CTTCCTGCGC TGCGAGGCCA AGAACTACAG CGGCCGCTTC 20 

ACCTGCTGGT GGCTGACCAC GATCAGCACC GACCTGACCT TCAGTGTGAA GAGCAGCAGG 4 80 

GGCTCCAGCG ACCCCCAGGG CGTGACCTGC GGCGCTGCCA CCCTGAGCGC CGAGCGCGTG 540 

CGCGGCGACA ACAAGGAGTA CGAGTACAGC GTGGAGTGCC AGGAAGACTC CGCCTGCCCC 600 

GCCGCTGAGG AGAGCCTGCC CATCGAGGTG ATGGTGGACG CCGTTCACAA GCTGAAGTAC 660 

4 0 GAGAACTACA CCAGCAGCTT CTTCATCCGC GACATCATCA AGCCTGACCC ACCCAAGAAC 720 

40 CTCCAGCTGA SSI GAACTCCCGC CAGGTGGAGG TGAGCTGGGA GTACCCCGAC 780 

ATCTGGAGCA CGCCCCACTC CTACTTCTCC CTGACCTTCT GCGTGCAGGT CCAGGGCAAG 840 

AGCAAGCGCG AGA^GA CCGGGTGTTC ACCGACAAGA CCAGCGCCAC CGTCATCTGC 900 

CGCAAGAACG CCAGCATCAG CGTGCGCGCC CAGGACCGCT ACTATAGCTC CTCTTGGAGC 960 
4 5 GAGTGGGCCA GCGTGCCCTG CTCCTAG 


Met Cvs Pro Ala Arg Ser Leu Leu Leu Val Ala Thr Leu Val Leu Leu 
15 10 15 
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Asp His Leu Ser Leu Ala Arg Asn Leu Pro Val Ala Thr Pro Asp Pro 
20 25 30 

Gly Met Phe Pro Cys Leu His His Ser Gin Asn Leu Leu Arg Ala Val 
35 40 45 

Ser Asn Met Leu Gin Lys Ala Arg Gin Thr Leu Glu Phe Tyr Pro Cys 
50 55 60 

Thr Ser Glu Glu lie Asp His Glu Asp lie Thr Lys Asp Lys Thr Ser 
65 70 75 80 

Thr Val Glu Ala Cys Leu Pro Leu Glu Leu Thr Lys Asn Glu Ser Cys 
85 90 95 

Leu Asn Ser Arg Glu Thr Ser Phe lie Thr Asn Gly Ser Cys Leu Ala 
100 105 - HO 

Ser Arg Lys Thr Ser Phe Met Met Ala Leu Cys Leu Ser Ser lie Tyr 
115 120 125 


Glu Asp Leu Lys Met Tyr Gin Val Glu Phe Lys Thr Met Asn Ala Lys 
130 135 140 

Leu Leu Met Asp Pro Lys Arg Gin lie Phe Leu Asp Gin Asn Met Leu 
145 150 155 160 

Ala Val lie Asp Glu Leu Met Gin Ala Leu Asn Phe Asn Ser Glu Thr 
165 170 175 

Val Pro Gin Lys Ser Ser Leu Glu Glu Pro Asp Phe Tyr Lys Thr Lys 
180 185 190 

lie Lys Leu Cys lie Leu Leu His Ala Phe Arg lie Arg Ala Val Thr 
195 200 205 

lie Asp Arg Val Thr Ser Tyr Leu Asn Ala Ser 
210 215 

<210> .6 

<211> 660 

<212> nucleic acid 

<400> 6 


ATGTGTCCAG CGCGCAGCCT CCTCCTTGTG GCTACCCTGG TCCTCCTGGA CCACCTCACT €0 

TTGGCCAGAA ACCTCCCCGT GGCCACTCCA GACCCAGGAA TGTTCCCATG CCTTCACCAC 120 

TCCCAAAACC TGCTGAGGGC CGTCAGCAAC ATGCTCCAGA AGGCCAGACA AACTCTAGAA 180 

TTTTACCCTT GCACTTCTGA AGAGATTGAT CATGAAGATA TCACAAAAGA T AAAACC AG C 24 0 

ACAGTGGAGG CCTGTTTACC ATT GGAATT A ACCAAGAATG AGAGTTGCCT AAATT CC AG A 300 

GAGACCT CTT TCATAACTAA TGGGAGTTGC CTGGCCTCCA GAAAGACCTC TTTTATGATG 360 

GCCCTGTGCC TTAGTAGTAT TTATGAAGAC TTGAAGATGT ACCAGGTGGA GTTCAAGACC 4 20 

ATGAATGCAA AGCTTCTGAT GGATCCTAAG AGGCAGATCT TTCTAGATCA AAACATGCTG 4 80 

GCAGTTATTG ATGAGCTGAT GCAGGCCCTG AATTTCAACA GTGAGACTGT GCCACAAAAA 54 0 

TCCTCCCTTG AAGAACCGGA TTTTTATAAA ACTAAAATCA AGCTCTGCAT ACTTCTTCAT 600 

GCTTTCAGAA TTCGGGCAGT G ACT ATT G AC AGAGTGACGA GCTATCTGAA TGCTTCCTAA 660 
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<210> 7 

<211> 660 

<212> nucleic acid 

<400> 7 

5 

ATGTGCCCCG CCCGCAGCCT GCTGCTGGTG GCCACCCTGG TGCTGCTGGA CCACCTGAGC 60 

CTGGCCCGCA ACCTGCCCGT GGCCACCCCC GACCCCGGCA TGTTCCCCTG CCTGCACCAC 120 

AGCCAGAACC TGCTGGCGGC CGTGAGCAAC ATGCTGCAGA AGGCCGCGCA GACCCTGGAG 180 

TTCTACCCCT GCACCAGCGA GGAGATCGAC CACGAGGACA T C ACCAAGG A CAAGACCAGC 240 

10 ACCGTGGAGG CCTGCCTGCC CCTGGAGCTG ACCAAGAACG AGAGCTGCCT GAACAGCCGC 300 

GAGACCAGCT TCATCACCAA CGGCAGCTGC CTGGCCAGCC GCAAG AC C AG CTTCATGATG 360 

GCCCTGTGCC TGAGCAGCAT CTACGAGGAC CTGAAGATGT ACCAGGTGGA GTTCAAGACC 420 

ATGAACGCCA AGCTGCTGAT GGACCCCAAG CTCCAGATCT TCCTGGACCA GAACATGCTG 480 

GCCGTGATCG ACG AG CTG AT GCAGGCCCTG AACTTCAACA GCGAGACCGT GCCCCAGAAG 540 

15 AGCAGCCTGG AGGAGCCCGA CTTCTACAAG ACCAAGATCA AGCTGTGCAT CCTGCTGCAC 600 

GCCTTCCGCA TCCGCGCCGT GACCATCGAC CGCGTGACCA GCTACCTGAA CGCCACCTGA 660 

<210> 8 
<211> 660 
20 <212> nucleic acid 

<400> 8 

ATGTGCCCCG CCCGCAGCCT GCTGCTCGTG GCCACCCTGG TGCTCCTGGA CCACCTCAGC 60 

CTGGCCCGCA ACCTCCCCGT GGCCACCCCA GACCCCGGCA TGTTCCCATG CCTGCACCAC 120 

25 AGCCAGAACC TGCTGGCGGC CGTGAGCAAC ATGCTGCAGA AGGCCGCGCA GACCCTGGAG 180 

TTCTACCCCT GCACCAGCGA GGAGATCGAC CACGAGGACA TCACCAAGGA CAAGACCAGC 240 

ACCGTGGAGG CCTGCCTGCC CCTCGAGTTA ACCAAGAACG AGAGCTGCCT CAACAGCCGC 300 

GAGACCTCCT TCATCACCAA CGGCACTTGC CTGGCCTCCC GCAAGACCAG CTTCATGATG 360 

GCCCTGTGCC TGAGCTCCAT CTACGAGGAC CTGAAGATGT ACCAGGTGGA GTTCAAGACC 4 20 

30 ATGAACGCCA AGCTCCTGAT GGACCCCAAG CTCCAGATCT TCCTGGACCA GAACATGCTG 480 

GCCGTGATCG ACGAGCTGAT GCAGGCCCTG AACTTCAACA GCGAGACCGT GCCCCAGAAG 54 0 

AGCAGCCTGG AGGAGCCCGA CTTCTACAAG ACCAAGATCA AGCTGTGCAT CCTGCTGCAC 600 

GCCTTCCGCA TCCGGGCCGT GACCATCGAC CGCGTGACCA GCTACCTGAA CGCCACGTGA 660 

35 <210> 9 

<211> 188 

<212> amino acid 

<400> 9 

4 0 Met Ala Leu Thr Phe Ala Leu Leu Val Ala Leu Leu Val Leu Ser Cys 
15 10 15 

Lys Ser Ser Cys Ser Val Gly Cys Asp Leu Pro Gin Thr His Ser Leu 
20 25 30 


45 


Gly Ser Arg Arg Thr Leu Met Leu Leu Ala Gin Met Arg Arg lie Ser 
35 40 45 


Leu Phe Ser Cys Leu Lys Asp Arg His Asp Phe Gly Phe Pro Gin Glu 

50 50 55 60 

Glu Phe Gly Asn Gin Phe Gin Lys Ala Glu Thr lie Pro Val Leu His 

65 70 75 80 
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Glu Met lie Gin Gin He Phe Asn Leu Phe Ser Thr Lys Asp Ser Ser 
85 90 95 

Ala Ala Trp Asp Glu Thr Leu Leu Asp Lys Phe Tyr Thr Glu Leu Tvr 
100 105 110 

Gin Gin Leu Asn Asp Leu Glu Ala Cys Val He Gin Gly Val Gly Val 
115 120 125 

Thr Glu Thr Pro Leu Met Lys Glu Asp Ser He Leu Ala Val Arg Lys 
130 135 140 V y 

Tyr Phe Gin Arg He Thr Leu Tyr Leu Lys Glu Lys Lys Tyr Ser Pro 
145 150 155 i 60 

Cys Ala Trp Glu Val Val Arg Ala Glu He Met Arg .Ser Phe Ser Leu 
165 170 * i7 5 

Ser Thr Asn Leu Gin Glu Ser Leu Arg Ser Lys Glu 
180 i 8 5 

<210> 10 

<211> 567 

<212> nucleic acid 

<400> 10 


<210> H 

<211> 567 

<212> nucleic acid 

<400> 11 


60 
120 
180 
240 


CCTTTGCT TT ACTGGTGGCC CTCCTGGTGC TCAGCTGCAA GTCAAGCTGC 
ISSI GGGCT GTGATCTGCC TCAAACCCAC AGCCTGGGTA GCAGGAGGAC CT TGATGCT C 
ScCC^r? Ir^SS^ CTCTCTTTTC TCCTGCTTGA AGGACAGACA TGACTTTGGA 

™™^ C ~22 AGGAGTT TGG CAACCAGTTC CAAAAGGCTG AAACCATCCC TGTCCTCCAT 24 0 

rAr A r^ A ^ AGCAGATCTT CAATCTCTTC AGCACAAAGG ACTCATCTGC TGCTTGGGAT 300 

^^ CTCC TAGACAAATT CTACACTGAA CTCTACCAGC AG CT G AATGA CCTGGAAGCC 360 

GCTGTGAGGA *lrir^ G GGTGACAGAG ACTCCCCTGA TGAAGGAGGA CTCCAT^C^G «S 

TfTrrrTrrr AATACTTCCA AAGAATCACT CTCTATCTGA AAGAGAAGAA ATACAGCCCT 4 80 

ffiSSS SgT TC ATGAGATCTT tttctttgtc aacaaacttg t?o 


rlcGTG^rr rrrlrr^l GCTGGTGGCC CTCCTGGTGC TCAGCTGCAA GAGCAGCTGC 60 

GCGACCTGCC CCAGACCCAC AGCCTGGGGA GCCGGCGGAC CCTGATGCTG 120 

S GGGGCAGA TGCGGCGGAT CAGCCTGTTC AGCTGCCTGA AGGACCGGCA CGACTTCGGG llo 

SSSS AGGAG TTCGG CAACCAGTTC CAGAAGGCCG AGACCATCCC CG^GCTGcSc 240 

AGCAGATCTT CAACCTGTTC AGCACCAAGG ACAGCAGCG C CGCCTGGGAC 300 

GAGACCCTGC TGGACAAGTT CTACACCGAG CTCTACCAGC AGCTGAACGA CCTGTAfirrr Hn 

Zrrrirr™ 0 AGGGGG * GGG GGTGACCGAG ACCCCCCTGA TGAAGGAGGA CAGCATCCTG 420 

?GCG?CTC? A a™^? GCGGATCACC CTGTACCTGA AGGAGAAGAA GTACTCCCCC 480 

£gcagagcc A gcggS gcag?ca ATC atgcggagct tcagcctgag caccaacctg 540° 

567 


95W7676A3 IA> 


WO 99/47678 
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7 


<210> 12 

<211> 567 

<212> nucleic acid 

<223> "Y" stands for C or T; "R" stands for A or G; 

5 "W M stands for T or A; "S" stands for C or G; "N" 
stands for any base. 
<400> 12 

ATGGCNYTNA CNTTYGCNYT NYTNGTNGCN YTNYTNGTNY TNWSNTGYAA RWSNWSNTGY 60 

10 WSNGTNGGNT GYGAYYTNCC NCARACNCAY WSNYTNGGNW SNMGNMGNAC NYTNATGYTN 120 

YTNGCNCARA TGMGNMGNAT HWSNYTNTTY WSNTGYYTNA ARGAYMGNCA YGAYTTYGGN 180 

TTYCCNCARG ARGARTTYGG NAAYCARTTY CARAARGCNG ARACNAT HCC NGTNYTNCAY 240 

GARATGATHC ARCARATHTT YAAYYTNTTY WSNACNAARG AYWSNWSNGC NGCNTGGGAY 300 

GARACNYTNY TNGAYAARTT YTAYACNGAR YTNTAYCARC ARYTNAAYGA YYTNGARGCN 360 

15 TGYGTNATHC ARGGNGTNGG NGTNACNGAR ACNCCNYTNA TGAARGARGA YWSNATHYTN 420 

GCNGTNMGNA ARTAYTTYCA RMGNATHACN YTNTAYYTNA ARGARAARAA RTAYWSNCCN 4 80 

TGYGCNTGGG ARGTNGTNMG NGCNGARATH ATGMGNWS NT TYWSNYTNWS NACNAAYYTN 54 0 

CARGARWSNY TNMGNWSNAA RGARTRR 567 

20 <210> 13 
<211> 191 
<212> nucleic acid 

<400> 13 

25 GGGTGGCATC CCTGTGACCC CTCCCCAGTG CCTCTCCTGG CCCTGGAAGT TGCCACTCCA 60 
GTGCCCACCA GCCTTGTCCT AATAAAATTA AGTTGCATCA TTTTGTCTGA CTAGGTGTCC 120 
TTCTATAATA TTATGGGGTG GAGGGGGGTG GTATGGAGCA AGGGGCAAGT TGGGAAGACA 180 
ACCTGTAGGG C 191 

30 <210> 14 
<211> 58 
<212> nucleic acid 

<400> 14 

35 AAGCTTACTC AACACAATAA CAAACTTACT TACAATCTTA ATTAACAGGC CACCATGG 58 

<210> 15 
<211> 15 
<212> nucleic acid 

40 <400> 15 

CAGGTAAGTG TCTTC 


15 


BNSDOCID: <WO 9947679A3 IA: 


WO 99/47678 


PCT/US99/05394 


<210> 16 

<211> 30 

<212> nucleic acid 

<400> 16 

TACTAACGGT TCTTTTTTTC TCTTCACAGG 


<210> 
<211> 
10 <212> 
<400> 


17 
271 

nucleic acid 
17 


15 


20 


SEE EE! j™ «sss sszss ssass 

s=i ~ si F^- »~ = 


<210> 
<211> 
<212> 
<400> 


CCGGCCACAG 
2 5 GGCATCGCCA 
GAACAGTTCG 
ACCGGCTTCC 
GCAGGTAGCC 
CTCGGCAGGA 
30 CCAGTCCCTT 
GGCCAGCCAC 
GGTCTTGACA 
GCAGCCGATT 
AGAACCTGCG 
35 ATCAGATCTT 
TTTGCAGGGC 
TACATAACTT 
GTCAATAATG 
GGTGGAGTAT 
4 0 TACGCCCCCT 
GACCTTATGG 
GGTGATGCGG 
TCCAAGTCTC 
CTTTCCAAAA 

4 5 GTGGGAGGTC 

TCCACGCTGT 
ACGGTGCATT 
TCCTTCCCCT 
TAGCAGTAAT 

5 0 GCCTCCTCCT 

CCGTGGCCAC 
GGGCCGTCAG 
CTGAAGAGAT 
TACCATTGGA 
5 5 CTAATGGGAG 
GTATTTATGA 
TGATGGATCC 


18 

5686 

nucleic acid 
18 


TCGATGAATC 
TGCGTCACGA 
GCTGGCGCGA 
ATCCGAGTAC 
GGATCAAGCG 
GCAAGGTGAG 
CCCGCTTCAG 
GATAGCCGCG 
AAAAGAACCG 
GTCTGTTGTG 
TGCAATCCAT 
GATCCCCTGC 
TTCCCAACCT 
ACGGTAAATG 
ACGTATGTTC 
TTACGGTAAA 
ATTGACGTCA 
GACTTTCCTA 
TTTTGGCAGT 
CACCCCATTG 
TGTCGTAACA 
TATATAAGCA 
TTTGACCTCC 
GGAACGCGGA 
GCTATTCTGC 
ACTAACGGTT 
TGTGGCTACC 
TCCAGACCCA 
CAACATGCTC 
TGATCATGAA 
ATTAACCAAG 
TTGCCTGGCC 
AGACTTGAAG 
TAAGAGGCAG 


CAGAAAAGCG 
CGAGATCCTC 
GCCCCTGATG 
GTGCTCGCTC 
TATGCAGCCG 
ATGACAGGAG 
TGACAACGTC 
CTGCCTCGTC 
GGCGCCCCTG 
CCCAGTCATA 
CTTGTTCAAT 
GCCATCAGAT 
TACCAGAGGG 
GCCCGCCTGG 
CCATAGTAAC 
CTGCCCACTT 
ATGACGGTAA 
CTTGGCAGTA 
ACATCAATGG 
ACGTCAATGG 
ACTCCGCCCC 
GAGCTCGTTT 
ATAGAAGACA 
TTCCCCGTGT 
TCAACCTTCC 
CTTTTTTTCT 
CTGGTCCTCC 
GGAATGTTCC 
CAGAAGG CCA 
GATATCACAA 
AATGAGAGTT 
TCCAGAAAGA 
ATGTACCAGG 
ATCTTTCTAG 


GCCATTTTCC 
GCCGTCGGGC 
CTCTTCGTCC 
GATGCGATGT 
CCGCATTGCA 
ATCCTGCCCC 
GAGCACAGCT 
CTGCAGTTCA 
CGCTGACAGC 
GCCGAATAGC 
CATGCGAAAC 
CCTTGGCGGC 
CGAATTCGAG 
CTGACCGCCC 
GCCAATAGGG 
GGCAGTACAT 
ATGGCCCGCC 
CATCTACGTA 
GCGTGGATAG 
GAGTTTGTTT 
ATTGACGCAA 
AGTGAACCGT 
CCGGGACCGA 
TAATTAACAG 
TATCAGAAAC 
CTTCACAGGC 
TGGACCACCT 
CATGCCTTCA 
GACAAACTCT 
AAGATAAAAC 
GCCTAAATTC 
CCTCTTTTAT 
TGGAGTTCAA 
ATCAAAACAT 


ACCATGATAT 
ATGCGCGCCT 
AGATCATCCT 
TTCGCTTGGT 
TCAGCCATGA 
GGCACTTCGC 
GCGCAAGGAA 
TTCAGGGCAC 
CGGAACACGG 
CTCTCCACCC 
GATCCTCATC 
AAGAAAGCCA 
CTTGCATGCC 
AACGACCCCC 
ACTTTCCATT 
CAAGTGTATC 
TGGCATTATG 
TTAGTCATCG 
CGGTTTGACT 
TGGCACCAAA 
ATGGGCGGTA 
CAGATCGCCT 
TCCAGCCTCC 
GTAAGTGTCT 
TGCAGTATCT 
CACCATGGGT 
CAGTTTGGCC 
CCACTCCCAA 
AGAATTTTAC 
CAGCACAGTG 
CAGAGAGACC 
GATGGCCCTG 
GACCATGAAT 
GCTGGCAGTT 


TCGGCAAGCA 
TGAGCCTGGC 
G AT CGACAAG 
GGTCGAATGG 
TGGATACTTT 
CCAATAGCAG 
CGCCCGTCGT 
CGGACAGGTC 
CGGCAT CAGA 
AAGCGGCCGG 
CTGTCTCTTG 
TCCAGTTTAC 
TGCAGGTCGT 
GCCCATTGAC 
GACGTCAATG 
ATATGCCAAG 
CCCAGTACAT 
CTATTACCAT 
CACGGGGATT 
ATCAACGGGA 
GGCGTGTACG 
GGAGACGCCA 
GCGGCCGGGA 
TCCTCCTGTT 
GTATTTTTGC 
CCAGCGCGCA 
AGAAACCTCC 
AACCTGCTGA 
CCTTGCACTT 
GAGGCCTGTT 
TCTTTCATAA 
TGCCTTAGTA 
GCAAAGCTTC 
ATTGATGAGC 


60 
120 
180 
240 
271 


60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 


30 
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9 


10 


15 


20 


25 


30 


35 


40 


45 


50 


55 


60 


TGATGCAGGC 
CGGATTTTTA 
CAGTGACTAT 
GAATTCTGCA 
GGAAGTTGCC 
GTCT G ACT AG 
GCAAGTTGGG 
TTACGGTAAA 
TGACGTATGT 
ATTTACGGTA 
CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 
AATGTCGTAA 
TCTATATAAG 
GTTTTGACCT 
TTGGAACGCG 
CTGCTATTCT 
TACTAACGGT 
TCTCTTGGTT 
AAGATGTTTA 
T C AC CTGTG A 
TCTTAGGCTC 
ACACCTGTCA 
AAGATGGAAT 
TTCTAAGATG 
TC AGT ACT G A 
TGACGTGCGG 
AGTACTCAGT 
TTG AGGT CAT 
T CAT C AGGG A 
ATTCTCGGCA 
ACTTCTCCCT 
GAGTCTTCAC 
TGCGGGCCCA 
GTTAGACGCG 
CCCAGTGCCT 
AAAATTAAGT 
GGGGGTGGTA 
CCGGTACCAG 
CATAGCTGTT 
GAAGCATAAA 
TGCGCTCACT 
GCCAACGCGC 
ACTCGCTGCG 
TACGGTTATC 
AAAAGGCCAG 
CTGACGAGCA 
AAAG AT AC C A 
CGCTTACCGG 
CACGCTGTAG 
' AACCCCCCGT 
CGGTAAGACA 
GGTATGTAGG 
GG AC AGT ATT 
GCTCTTGATC 
AGATTACGCG 
ACGCTCAGAA 
CGATACCGTA 
CACGGGTAGC 


CCTGAATTTC 
TAAAACTAAA 
TGATAGAGTG 
GGAATTGGGT 
ACT CC AGT G C 
GTGTCCTTCT 
AAGACAACCT 
TGGCCCGCCT 
T CC CAT AGT A 
AACTGCCCAC 
CAATGACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTCGT 
CCATAGAAGA 
GATTCCCCGT 
GCTCAACCTT 
TCTTTTTTTC 
TTCCCTGGTT 
TGTCGTAGAA 
CACCCCTGAA 
TGGCAAAACC 
CAAAGGAGGC 
TTGGTCCACT 
CGAGGCCAAG 
TTTGACATTC 
AGCTGCTACA 
GGAGTGCCAG 
GGTGGATGCC 
CAT C ATCAAA 
GGTGGAGGTC 
G AC ATT CT GC 
GGACAAGACC 
GGACCGCTAC 
CTAGAAAAGC 
CTCCTGGCCC 
TGCATCATTT 
TGGAGCAAGG 
CTTTTGTTCC 
TCCTGTGTGA 
GTGTAAAGCC 
GCCCGCTTTC 
GGGGAGAGGC 
CTCGGTCGTT 
CACAGAATCA 
GAACCGTAAA 
T C AC AAAAAT 
GGCGTTTCCC 
ATACCTGTCC 
GTATCTCAGT 
TCAGCCCGAC 
CGACTTATCG 
CGGTGCTACA 
TGGTATCTGC 
CGGCAAACAA 
CAGAAAAAAA 
GAACTCGTCA 
AAGCACGAGG 
CAACGCTATG 


AACAGTGAGA 
ATCAAGCTCT 
ATGAGCTATC 
GGCATCCCTG 
CCACCAGCCT 
ATAATATTAT 
GTAGGGCTCG 
GGCTGACCGC 
ACGCCAATAG 
TTGGCAGTAC 
AAATGGCCCG 
TACATCTACG 
GGGCGTGGAT 
GGGAGTTTGT 
CCATTGACGC 
TTAGTGAACC 
CACCGGGACC 
GTTAATTAAC 
CCTATCAGAA 
TCTTCACAGG 
TTTCTGGCAT 
TTGGATTGGT 
GAAGATGGTA 
CTGACCATCC 
GAGGTTCTAA 
GATATTTTAA 
AATTATTCTG 
AGTGTCAAAA 
CTCTCTGCAG 
GAGGACAGTG 
GTTCACAAGC 
CCTGACCCAC 
AGCTGGGAGT 
GTTCAGGTCC 
TCAGCCACGG 
TATAGCTCAT 
CGAATTCTGC 
TGGAAGTTGC 
TGTCTGACTA 
GGCAAGTTGG 
CTTTAGTGAG 
AATTGTTATC 
TGGGGTGCCT 
CAGTCGGGAA 
GGTTTGCGTA 
CGGCTGCGGC 
GGGGATAACG 
AAGGCCGCGT 
CGACGCTCAA 
CCTGGAAGCT 
GCCTTTCTCC 
TCGGTGTAGG 
CGCTGCGCCT 
CCACTGGCAG 
GAGTTCTTGA 
GCTCTGCTGA 
ACCACCGCTG 
GGATCTCAAG 
AGAAGGCGAT 
AAGCGGTCAG 
TCCTGATAGC 


CTGTGCCACA 
GCATACTTCT 
TGAATGCTTC 
TGACCCCTCC 
TGTCCTAATA 
GGGGTGGAGG 
AGGGGGGGCC 
CCAACGACCC 
GGACTTTCCA 
ATCAAGTGTA 
CCTGGCATTA 
TATTAGTCAT 
AGCGGTTTGA 
TTTGGCACCA 
AAATGGGCGG 
GTCAGATCGC 
GAT CCAGCCT 
AGGTAAGTGT 
ACTGCAGTAT 
CTTAAGTCAT 
CTCCCCTCGT 
AT CCGGAT GC 
TCACCTGGAC 
AAGT C AAAG A 
GCCATTCGCT 
AGGACCAGAA 
GACGTTTCAC 
GCAGCAGAGG 
AGAGAGTCAG 
CCTGCCCAGC 
TCAAGTATGA 
CCAAGAACTT 
ACCCTGACAC 
AGGGCAAGAG 
TCATCTGCCG 
CTTGGAGCGA 
AGGAATTGGG 
C ACT CC AGTG 
GGTGTCCTTC 
GAAGACAACC 
GGTTAATTTC 
CG CT CACAAT 
AATGAGTGAG 
ACCTGTCGTG 
TTGGGCGCTC 
GAGCGGTATC 
CAGGAAAGAA 
TGCTGGCGTT 
GTCAGAGGTG 
CCCTCGTGCG 
CTTCGGGAAG 
TCGTTCGCTC 
TATCCGGTAA 
CAGCCACTGG 
AGTGGTGGCC 
AGCCAGTTAC 
GTAGCGGTGG 
AAGATCCTTT 
AGAAGGCGAT 
CCCATTCGCC 
GGTCCGCCAC 


AAAATCCTCC 
TCATGCTTTC 
CTAACAATTC 
CCAGTGCCTC 
AAATTAAGTT 
GGGGTGGTAT 
CGGTACGGTC 
CCGCCCATTG 
TTGACGTCAA 
TCATATGCCA 
TGCCCAGTAC 
CGCTATTACC 
CTCACGGGGA 
AAATCAACGG 
TAGGCGTGTA 
CTGGAGACGC 
CCGCGGCCGG 
CTTCCTCCTG 
CTGTATTTTT 
GGGTCACCAG 
GGCCATATGG 
CCCTGGAGAA 
CTTGGACCAG 
GTTTGGAGAT 
CCTGCTGCTT 
AGAACCCAAA 
CTGCTGGTGG 
CTCTTCTGAC 
AGGGGACAAC 
TGCTGAGGAG 
AAACTACACC 
GCAGCTGAAG 
CTGGAGTACT 
CAAGAGAGAA 
C AAAAAT GCC 
ATGGGCATCT 
TGGCATCCCT 
CCCACCAGCC 
TATAATATTA 
TGTAGGGCTC 
GAGCTTGGCG 
TCCACACAAC 
CTAACTCACA 
CCAGCTGCAT 
TTCCGCTTCC 
AGCTC ACT C A 
CATGTGAGCA 
T TT C CAT AGG 
GCGAAACCCG 
CTCTCCTGTT 
CGTGGCGCTT 
CAAGCTGGGC 
CTATCGTCTT 
TAACAGGATT 
TAACTACGGC 
CTTCGGAAAA 
TTTTTTTGTT 
GATCTTTTCT 
GCGCTGCGAA 
GCCAAGCTCT 
ACCCAG 


CTTGAAGAAC 
AGAATTCGGG 
TAGAAAAGCC 
TCCTGGCCCT 
GCATCATTTT 
GGAGCAAGGG 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGAGT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 
GACTTTCCAA 
CGGTGGGAGG 
CATCCACGCT 
GAACGGTGCA 
TTTCCTTCCC 
GCTAGAATTG 
CAGTTCjGTCA 
GAACTGAAGA 
ATGGTGGTCC 
AGCAGT G AGG 
GCTGGCCAGT 
CACAAAAAGG 
AATAAGACCT 
CTGACGACAA 
CCCCAAGGGG 
AAGGAGTATG 
AGTCTGCCCA 
AGCAGCTTCT 
CCATTAAAGA 
CCACATTCCT 
AAGAAAGATA 
AGCATTAGCG 
GTGCCCTGCA 
GTGACCCCTC 
TTGTCCTAAT 
TGGGGTGGAG 
GAGGGGGGGC . 
TAATCATGGT 
ATACGAGCCG 
TTAATTGCGT 
TAATGAATCG 
TCGCTCACTG 
AAGGCGGTAA 
AAAGGCCAGC 
CTCCGCCCCC 
ACAGGACTAT 
CCGACCCTGC 
TCTCATAGCT 
TGTGTGCACG 
GAGTCCAACC 
AGCAGAGCGA 
TACACTAGAA 
AG AGTT GGT A 
TGCAAGCAGC 
ACGGGGTCTG 
TCGGGAGCGG 
TCAGCAATAT 


2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 

3780 

3840 

3900 

3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 

5686 
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10 


15 


20 


25 


30 


35 


40 


45 


50 


55 


60 


<210> 
<211> 
<212> 
<400> 

GATCCATGGC 
CAGCCTGTTC 
TGACAGTCCT 
ATTTTGGATT 
TTGTGCTAAG 
CTACTTGGAA 
AT CT CAAAGC 
TGAGGACATA 
CCTGGGAGGT 
CAAGACTGAG 
CCTGTGACCC 
GCCTTGTCCT 
TTATGGGGTG 
GGCTCGAGGG 
TGGCGTAATC 
ACAACATACG 
TCACATTAAT 
TGCATTAATG 
CTTCCTCGCT 
ACTCAAAGGC 
GAGCAAAAGG 
ATAGGCTCCG 
ACCCGACAGG 
CTGTTCCGAC 
CGCTTTCTCA 
TGGGCTGTGT 
GTCTTGAGTC 
GGATTAGCAG 
ACGGCTACAC 
GAAAAAGAGT 
TTGTTTGCAA 
TTTCTACGGG 
. GCGAATCGGG 
GCTCTTCAGC 
GCCGGCCACA 
AGGCATCGCC 
CGAACAGTTC 
GACCGGCTTC 
GGCAGGTAGC 
TCTCGGCAGG 
GCCAGTCCCT 
TGGCCAGCCA 
CGGTCTTGAC 
AGCAGCCGAT 
GAGAACCTGC 
GATCAGATCT 
CTTTGCAGGG 
TTACATAACT 
CGTCAATAAT 
GGGTGGAGTA 
GTACGCCCCC 
TGACCTTATG 
TGGTGATGCG 
TTCCAAGTCT 
ACTTTCCAAA 
GGTGGGAGGT 
ATCCACGCTG 
CTAGAG 


19 

3426 

nucleic acid 
19 


TAGGCTCTGT 
TCTAGGATGT 
GGAAGAAATG 
CCCCTTGGAG 
AGATCTTACC 
TGCAACTCTC 
CTGTGTGATG 
CTTCCACAGG 
GATCAGAGCA 
TGAGGAGAAG 
CTCCCCAGTG 
AATAAAATTA 
GAGGGGGGTG 
GGGGCCCGGT 
ATGGTCATAG 
AGCCGGAAGC 
TGCGTTGCGC 
AATCGGCCAA 
CACTGACTCG 
GGTAATACGG 
CCAGCAAAAG 
CCCCCCTGAC 
ACTATAAAGA 
CCTGCCGCTT 
TAGCTCACGC 
GCACGAACCC 
CAACCCGGTA 
AGCGAGGTAT 
TAGAAGGACA 
TGGTAGCTCT 
GCAGCAGATT 
GTCTGACGCT 
AGCGGCGATA 
AATATCACGG 
GTCGATGAAT 
ATGCGTCACG 
GGCTGGCGCG 
CAT CCG AGT A 
CGGAT CAAGC 
AGCAAGGTGA 
TCCCGCTTCA 
CGATAGCCGC 
AAAAAGAACC 
TGTCTGTTGT 
GTGCAATCCA 
TGATCCCCTG 
CTTCCCAACC 
TACGGTAAAT 
GACGTATGTT 
TTTACGGTAA 
TATTGACGTC 
GGACTTTCCT 
GTTTTGGCAG 
CCACCCCATT 
ATGTCGTAAC 
CTATATAAGC 
TTTTGACCTC 


GCTTTCCTCA 
GACCTGCCTC 
AGAAGACTCC 
AAGGTGGATA 
CAGCAGATTT 
CTAGACT CAT 
CAGGAACCTC 
ATCACTGTGT 
GAAGTCTGGA 
GAGTGATCTA 
CCTCTCCTGG 
AGTTGCATCA 
GTATGGAGCA 
ACCAGCTTTT 
CTGTTTCCTG 
ATAAAGTGTA 
TCACTGCCCG 
CGCGCGGGGA 
CTGCGCTCGG 
TTATCCACAG 
GCCAGGAACC 
GAGCATCACA 
TACCAGGCGT 
ACCGGATACC 
TGTAGGTATC 
CCCGTTCAGC 
AGACACGACT 
GTAGGCGGTG 
GTATTTGGTA 
TGATCCGGCA 
ACGCGCAGAA 
CAGAAGAACT 
CCGTAAAGCA 
GTAGCCAACG 
CCAGAAAAGC 
ACGAGATCCT 
AGCCCCTGAT 
CGTGCTCGCT 
GTATGCAGCC 
GATGACAGGA 
GTGACAACGT 
GCTGCCTCGT 
GGGCGCCCCT 
GCCCAGTCAT 
TCTTGTTCAA 
CGCCATCAGA 
TTACCAGAGG 
GGCCCGCCTG 
CCCATAGTAA 
ACTGCCCACT 
AATGACGGTA 
ACTTGGCAGT 
TACATCAATG 
GACGTCAATG 
AACTCCGCCC 
AGAGCTCGTT 
CATAGAAGAC 


TGATCCTAGT 
ACACTTATAA 
CCCCTCTTTC 
ACCAACAGAT 
TGAACCTCTT 
TCTGCAATGA 
CTCTGACCCA 
ACCTGAGAAA 
GAGCCCTCTC 
GAAAGCCGAA 
CCCTGGAAGT 
TTTTGTCTGA 
AGGGGCCCAA 
GTTCCCTTTA 
TGTGAAATTG 
AAGCCTGGGG 
CTTTCCAGTC 
GAGGCGGTTT 
TCGTTCGGCT 
AATCAGGGGA 
GTAAAAAGGC 
AAAATCGACG 
TTCCCCCTGG 
TGTCCGCCTT 
TCAGTTCGGT 
CCGACCGCTG 
TATCGCCACT 
CTACAGAGTT 
TCTGCGCTCT 
AACAAACCAC 
AAAAAGGATC 
CGTCAAGAAG 
CGAGGAAGCG 
CTATGTCCTG 
GGCCATTTTC 
CGCCGTCGGG 
GCTCTTCGTC 
CGATGCGATG 
GCCGCATTGC 
GATCCTGCCC 
CGAGCACAGC 
CCTGCAGTTC 
GCGCTGACAG 
AGCCGAATAG 
TCATGCGAAA 
TCCTTGGCGG 
GCGAATTCGA 
GCTGACCGCC 
CGCCAATAGG 
TGGCAGTACA 
AATGGCCCGC 
ACATCTACGT 
GGCGTGGATA 
GGAGTTTGTT 
CATTGACGCA 
TAGTGAACCG 
ACCGGGACCG 


AATGATGAGC 
CCTCGGGAAC 
CTGCCTGAAG 
CCAGAAGGCT 
CACATCAAAA 
CCTCCATCAG- 
GGAAGACTCC 
GAAGAAACAC 
TTCCTCAACC 
TTCTGCAGGA 
TGCCACTCCA 
CTAGGTGTCC 
GTTGGGAAGA 
GTGAGGGTTA 
TTATCCGCTC 
TGCCTAATGA 
GGGAAACCTG 
GCGTATTGGG 
GCGGCGAGCG 
TAACG CAGG A 
CGCGTTGCTG 
CTCAAGTCAG 
AAGCTCCCTC 
TCTCCCTTCG 
GTAGGTCGTT 
CGCCTTATCC 
GGCAGCAGCC 
CTTGAAGTGG 
GCTGAAGCCA 
CGCTGGTAGC 
TCAAGAAGAT 
GCGATAGAAG 
GTCAGCCCAT 
ATAGCGGTCC 
CACCATGATA 
CATGCGCGCC 
CAGAT CATCC 
TTTCGCTTGG 
ATCAGCCATG 
CGGCACTTCG 
TGCGCAAGGA 
ATTCAGGGCA 
CCGGAACACG 
CCTCTCCACC 
CGATCCTCAT 
CAAGAAAGCC 
GCTT GCATGC 
CAACGACCCC 
GACTTTCCAT 
T CAAGTGT AT 
CTGGCATTAT 
ATT AGT CATC 
GCGGTTTGAC 
TTGGCACCAA 
AATGGGCGGT 
TCAGATCGCC 
ATCCAGCCTC 


TACTACTGGT 
AAGAGGGCCT 
G AC AG G AAGG 
CAAGCCATCC 
GACTTGTCTG 
CAGCTCAATG 
CTGCTGGCTG 
AGCCTCTGTG 
AACTTGCTGG 
ATTTGGCATC 
GTGCCCACCA 
TTCTATAATA 
CAACCTGTAG 
ATTTCGAGCT 
ACAAT TCCAC 
GTGAGCTAAC 
TCGTGCCAGC 
CGCTCTTCCG 
GTATCAGCTC 
AAGAACATGT 
GCGTTTTTCC 
AGGTGGCGAA 
GTGCGCTCTC 
GGAAGCGTGG 
CGCTCCAAGC 
GGTAACTATC 
ACTGGTAACA 
TGGCCTAACT 
GTTACCTTCG 
GGTGGTTTTT 
CCTTTGATCT 
GCGATGCGCT 
TCGCCGCCAA 
GCCACACCCA 
TTCGG CAAGC 
TTGAGCCTGG 
TGATCGACAA 
TGGTCGAATG 
ATGGATACTT 
CCCAATAGCA 
ACGCCCGTCG 
CCGGACAGGT 
GCGGCATCAG 
CAAGCGGCCG 
CCTGTCTCTT 
ATCCAGTTTA 
CTGCAGGTCG 
CGCCCATTGA 
TGACGTCAAT 
CATATGCCAA 
GCCCAGTACA 
GCTATTACCA 
TCACGGGGAT 
AAT CAACGGG 
AGGCGTGTAC 
TGGAGACGCC 
CGGACTCTAG 
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<210> 
<211> 
<212> 
<400> 

AGCTTCGAGG 
TTGGCGTAAT 
CACAACATAC 
CTCACATTAA 
CTGCATTAAT 
GCTTCCTCGC 
CACTCAAAGG 
TGAGCAAAAG 
CATAGGCTCC 
AACCCGACAG 
CCTGTTCCGA 
GCGCTTTCTC 
CTGGGCTGTG 
CGTCTTGAGT 
AGGATTAGCA 
TACGGCTACA 
GGAAAAAGAG 
TTTGTTTGCA 
TTTTCTACGG 
TGCGAATCGG 
AGCTCTTCAG 
AGCCGGCCAC 
C AG GC AT CG C 
GCGAACAGTT 
AGACCGGCTT 
GGGCAGGTAG 
TTCTCGGCAG 
AGCCAGTCCC 
GTGGCCAGCC 
TCGGTCTTGA 
GAGCAGCCGA 
GGAGAACCTG 
TGATCAGATC 
ACTTTGCAGG 
GCTGTCCATA 
AGCTTGCATG 
CCAACGACCC 
GGACTTTCCA 
ATCAAGTGTA 
C CTGGCATT A 
TATTAGTCAT 
AGCGGTTTGA 
TTTGGCACCA 
• AAATGGGCGG 
GTCAGATCGC 
GATCCAGCCT 
AGGTAAGTGT 
ACTGCAGTAT 
GCCACGATGT 
CTCAGTTTGG 
AACCTGCTGA 
TCCTGCACTG 
AAGACCTGTT 
TCTTCCACAA 
TGCCTTGGTA 
GCAGCACTTC 
AT CG ATGAGC 
GTGGGAGAAG 
AGCACCCGCG 


20 

5966 

nucleic acid 
20 


GGGGGCCCGG 
C ATGGT CAT A 
GAGCCGGAAG 
TTGCGTTGCG 
GAATCGGCCA 
TCACTGACTC 
CGGTAATACG 
GCCAGCAAAA 
GCCCCCCTGA 
GACTATAAAG 
CCCTGCCGCT 
ATAGCTCACG 
TGCACGAACC 
CCAACCCGGT 
GAGCGAGGTA 
CTAGAAGGAC 
TTGGTAGCTC 
AGCAGCAGAT 
GGTCTGACGC 
GAGCGGCGAT 
CAATATCACG 
AGTCGATGAA 
CATGCGTCAC 
CGGCTGGCGC 
CCATCCGAGT 
CCGGATCAAG 
GAGCAAGGTG 
TTCCCGCTTC 
ACGATAGCCG 
CAAAAAGAAC 
TTGTCTGTTG 
CGTGCAATCC 
TTGATCCCCT 
GCTTCCCAAC 
AAACCGCCCA 
CCTGCAGGTC 
CCGCCCATTG 
TTGACGTCAA 
TCATATGCCA 
TGCCCAGTAC 
CGCTATTACC 
CTCACGGGGA 
AAATCAACGG 
TAGGCGTGTA 
CTGGAGACGC 
CCGCGGCCGG 
CTTCCTCCTG 
CTGTATTTTT 
GTCAATCACG 
CCAGGGTCAT 
AGACCACAGA 
CTGAAGACAT 
TACCACTGGA 
CAAGAGGGAG 
GCATCTATGA 
AGAATCACAA 
TGATGCAGTC 
CAGACCCTTA 
TCGTGACCAT 


TACCAGCTTT 
GCTGTTTCCT 
CATAAAGTGT 
CTCACTGCCC 
ACGCGCGGGG 
GCTGCGCTCG 
GTTATCCACA 
GGCCAGGAAC 
CGAGCATCAC 
ATACCAGGCG 
TACCGGATAC 
CTGTAGGTAT 
CCCCGTTCAG 
AAGACACGAC 
TGTAGGCGGT 
AGTATTTGGT 
TTGATCCGGC 
TACGCGCAGA 
TCAGAAGAAC 
ACCGTAAAGC 
GGTAGCCAAC 
TCCAGAAAAG 
GACGAGATCC 
GAGCCCCTGA 
ACGTGCTCGC 
CGTATGCAGC 
AGATGACAGG 
AGTGACAACG 
CGCTGCCTCG 
CGGGCGCCCC 
TGCCCAGTCA 
ATCTTGTTCA 
GCGCCATCAG 
CTTACCAGAG 
GTCTAGCAAC 
GTTACATAAC 
ACGTCAATAA 
TGGGTGGAGT 
AGTACGCCCC 
ATGACCTTAT 
ATGGTGATGC 
TTTCCAAGTC 
GACTTTCCAA 
CGGTGGGAGG 
CATCCACGCT 
GAACGGTGCA 
TTTCCTTCCC 
GCTAGCAGTA 
CTACCTCCTC 
TCCAGTCTCT 
TGACATGGTG 
CGATCATGAA 
ACTACACAAG 
CTGCCTGCCC 
GGACTTGAAG 
CCATCAGCAG 
TCTGAATCAT 
CAGAGTGAAA 
CAACAGGGTG 


TGTTCCCTTT 
GTGTGAAATT 
AAAGCCTGGG 
GCTTTCCAGT 
AGAGGCGGTT 
GTCGTTCGGC 
GAATCAGGGG 
CGTAAAAAGG 
AAAAAT CGAC 
TTTCCCCCTG 
CTGTCCGCCT 
CTCAGTTCGG 
CCCGACCGCT 
TTATCGCCAC 
GCTACAGAGT 
ATCTGCGCTC 
AAACAAACCA 
AAAAAAGGAT 
TCGTCAAGAA 
ACGAGGAAGC 
GCTATGTCCT 
CGGCCATTTT 
TCGCCGTCGG 
TGCTCTTCGT 
TCGATGCGAT 
CGCCGCATTG 
AGATCCTGCC 
TCGAGCACAG 
TCCTGCAGTT 
TGCGCTGACA 
TAGCCGAATA 
ATCATGCGAA 
ATCCTTGGCG 
GGCGCCCCAG 
TGTTGGGAAG 
TTACGGTAAA 
TGACGTATGT 
ATTTACGGTA 
CTATTGACGT 
GGGACTTTCC 
GGTTTTGGCA 
TCCACCCCAT 
AATGTCGTAA 
TCTATATAAG 
GTTTTGACCT 
TTGGAACGCG 
CTGCTATTCT 
ATACTAACGG 
TTTTTGGCCA 
GGACCTGCCA 
AAGACGGCCA 
G AC AT C ACAC 
AACGAGAGTT 
CCACAGAAGA 
ATGTACCAGA 
ATCATTCTAG 
AATGGCGAGA 
ATGAAGCTCT 
ATGGGCTATC 


AGTGAGGGTT 
GTTATCCGCT 
GTGCCTAATG 
CGGGAAACCT 
TGCGTATTGG 
TGCGGCGAGC 
ATAACGCAGG 
CCGCGTTGCT 
GCTCAAGTCA 
GAAGCTCCCT 
TTCTCCCTTC 
TGTAGGTCGT 
GCGCCTTATC 
TGGCAGCAGC 
TCTTGAAGTG 
TGCTGAAGCC 
CCGCTGGTAG 
CTCAAGAAGA 
GGCGATAGAA 
GGTCAGCCCA 
GATAGCGGTC 
CCACCATGAT 
GCATGCGCGC 
CCAGATCATC 
GTTTCGCTTG 
CATCAGCCAT 
CCGGCACTTC 
CTGCGCAAGG 
CATTCAGGGC 
GCCGGAACAC 
GCCTCTCCAC 
ACGATCCTCA 
GCAAGAAAGC 
CTGGCAATTC 
GGCGGGGCTG 
TGGCCCGCCT 
TCCCATAGTA 
AACTGCCCAC 
CAAT GACGGT 
TACTTGGCAG 
GTACATCAAT 
TGACGTCAAT 
CAACTCCGCC 
CAGAGCTCGT 
CCATAGAAGA 
GATTCCCCGT 
GCTCAACCTT 
TTCTTTTTTT 
CCCTTGCCCT 
GGTGTCTTAG 
GAGAAAAACT 
GGGACCAAAC 
GCCTGGCTAC 
CGTCTTTGAT 
CAGAGTTCCA 
ACAAGGGCAT 
CTCTGCGCCA 
GCATCCTGCT 
TGAGCTCCGC 


AATTTCGAGC 
CACAATTCCA 
AGTGAGCTAA 
GTCGTGCCAG 
GCGCTCTTCC 
GGTATCAGCT 
AAAGAACATG 
GGCGTTTTTC 
GAGGTGGCGA 
CGTGCGCTCT 
GGGAAGCGTG 
TCGCTCCAAG 
CGGTAACTAT 
CACTGGTAAC 
GTGGCCTAAC 
AGTTACCTTC 
CGGTGGTTTT 
TCCTTTGATC 
GGCGATGCGC 
TTCGCCGCCA 
CGCCACACCC 
ATT CGGCAAG 
CTTGAGCCTG 
CTGATCGACA 
GTGGTCGAAT 
GATGGATACT 
GCCCAATAGC 
AACGCCCGTC 
ACCGGACAGG 
GGCGGCATCA 
CCAAGCGGCC 
TCCTGTCTCT 
CATCCAGTTT 
CGGTTCGCTT 
CAGGAATTCG 
GGCTGACCGC 
ACGCCAATAG 
TTGGCAGTAC 
AAATGGCCCG 
TACATCTACG 
GGGCGTGGAT 
GGGAGTTTGT 
CCATTGACGC 
TTAGTGAACC 
CACCGGGACC 
GTTAATTAAC 
CCTATCAGAA 
CTCTTCACAG 
CCTAAACCAC 
CCAGTCCCGA 
GAAACATTAT 
CAGCACATTG 
TAGAGAGACT 
GATGACCCTG 
GGCCATCAAC 
GCTGGTGGCC 
GAAACCTCCT 
TCACGCCTTC 
CTTAACTAGT 
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GAAAGCTCAA 
TAGAGCTCGC 
CTCCCCCGTG 
TGAGGAAATT 
GCAGGACAGC 
CTCTATGGCT 
GCAGGTCGTT 
CCCATTGACG 
ACGTCAATGG 
TATGCCAAGT 
CCAGTACATG 
TATTACCATG 
ACGGGGATTT 
TCAACGGGAC 
GCGTGTACGG 
GAGACGCCAT 
CGGCCGGGAA 
CCTCCTGTTT 
TATTTTTGCT 
CTCAGAAGCT 
TGTGGGAGCT 
GAGAAACAGT 
ACCAGAGACA 
TAGATGCTGG 
TGCTCCACAA 
AGACTTTCCT 
TGCAAAGAAA 
GGGCAGTGAC 
ACTATGAGAA 
CCCTGCCCAT 
CCAGCTTCTT 
CTTTGAAGAA 
ATTCCTACTT 
AGACAGAGGA 
TCCAATGCAA 
GCAGCAAGTG 
ACTGTGCCTT 
CTGGAAGGTG 
CTGAGTAGGT 
TGGGAAGACA 
AGAACCAGCT 


GGCCCTCTGC 
TGATCAGCCT 
CCTTCCTTGA 
GCATCGCATT 
AAGGGGGAGG 
TCTGAGGCGG 
ACATAACTTA 
TCAATAATGA 
GTGGAGTATT 
ACGCCCCCTA 
ACCTTATGGG 
GTGATGCGGT 
CCAAGTCTCC 
TTTCCAAAAT 
TGGGAGGTCT 
CCACGCTGTT 
CGGTGCATTG 
CCTTCCCCTG 
AGCAGTAATA 
AACCATCTCC 
GGAGAAAGAC 
GAACCTCACC 
TGGAGTCATA 
CCAGTACACC 
GAAGGAAAAT 
GAAGTGTGAA 
CATGGACTTG 
ATGTGGAATG 
GTATTCAGTG 
TGAACTGGCG 
CATCAGGGAC 
CTCACAGGTG 
CTCCCTCAAG 
GGGGTGTAAC 
AGGCGGGAAT 
GGCATGTGTT 
CTAGTTGCCA 
CCACTCCCAC 
GTCATTCTAT 
ATAGCAGGCA 
GGGGCTCGAG 


CACAGCGCCC 
CGACTGTGCC 
CCCTGGAAGG 
GTCTGAGTAG 
ATTGGGAAGA 
AAAGAACCAG 
CGGTAAATGG 
CGTATGTTCC 
TACGGTAAAC 
TTGACGTCAA 
ACTTTCCTAC 
TTTGGCAGTA 
ACCCCATTGA 
GTCGTAACAA 
ATATAAGCAG 
TTGACCTCCA 
GAACGCGGAT 
CTATTCTGCT 
CTAACGGTTC 
TGGTTTGCCA 
GTTTATGTTG 
TGTGACACGC 
GGCTCTGGAA 
TGCCACAAAG 
GGAATTTGGT 
GCACCAAATT 
AAGTTCAACA 
GCGTCTCTGT 
TCCTGCCAGG 
TTGGAAGCAC 
AT CATC AAAC 
GAGGTCAGCT 
TTCTTTGTTC 
CAGAAAGGTG 
GTCTGCGTGC 
CCCTGCAGGG 
GCCATCTGTT 
TGTCCTTTCC 
TCTGGGGGGT 
TGCTGGGGAT 
CATGCA 


TCCTCACACA 
TTCTAGTTGC 
TGCCACTCCC 
GTGTCATTCT 
CAATAGCAGG 
CTGGGGCTCG 
CCCGCCTGGC 
CATAGTAACG 
TGCCCACTTG 
TGACGGTAAA 
TTGGCAGTAC 
CATCAATGGG 
CGTCAATGGG 
CTCCGCCCCA 
AGCTCGTTTA 
TAGAAGACAC 
TCCCCGTGTT 
CAACCTTCCT 
TTTTTTTCTC 
TCGTTTTGCT 
TAGAGGTGGA 
CTGAAGAAGA 
AGACCCTGAC 
GAGGCGAGAC 
CCACTGAAAT 
ACTCCGGACG 
TCAAGAGCAG 
CTGCAGAGAA 
AGGATGTCAC 
GGCAGCAGAA 
CAGACCCGCC 
GGGAGTACCC 
GAATCCAGCG 
CGTTCCTCGT 
AAGCTCAGGA 
TCCGATCCTA 
GTTTGCCCCT 
TAATAAAATG 
GGGGTGGGGC 
GCGGTGGGCT 


GATAGGAATT 
CAGCCATCTG 
ACTGTCCTTT 
ATTCTGGGGG 
CATGCTGGGG 
AAATTCGAGC 
TGACCGCCCA 
CCAATAGGGA 
GCAGTACATC 
TGGCCCGCCT 
ATCTACGTAT 
CGTGGATAGC 
AGTTTGTTTT 
TTGACGCAAA 
GTGAACCGTC 
CGGGACCGAT 
AATTAACAGG 
ATCAGAAACT 
TTCACAGGCC 
GGTGTCTCCA 
CTGGACTCCC 
TGACATCACC 
CATCACTGTC 
TCTGAGCCAC 
TTTAAAAAAT 
GTTCACGTGC 
TAGCAGTTCC 
GGTCACACTG 
CTGCCCAACT 
TAAATATGAG 
CAAGAACTTG 
TGACTCCTGG 
CAAGAAAGAA 
AGAGAAGACA 
TCGCTATTAC 
GAGCTCGCTG 
CCCCCGTGCC 
AGGAAATTGC 
AGGACAGCAA 
CTATGGCTTC 


CTGCAGATCC 
TTGTTTGCCC 
CCTAATAAAA 
GTGGGGTGGG 
ATGCGGTGGG 
TTGCATGCCT 
ACGACCCCCG 
CTTTCCATTG 
AAGTGTATCA 
GGCATTATGC 
TAGTCATCGC 
GGTTTGACTC 
GGCACCAAAA 
TGGGCGGTAG 
AGATCGCCTG 
CCAGCCTCCG 
TAAGTGTCTT 
GCAGTATCTG 
ACGATGTGTC 
CTCATGGCCA 
GATGCCCCTG 
TGGACCTCAG 
AAAGAGTTTC 
TCACATCTGC 
TTCAAAAACA 
TCATGGCTGG 
CCTGACTCTC 
GACCAAAGGG 
GCCGAGGAGA 
AACTACAGCA 
CAGATGAAGC 
AGCACTCCCC 
AAGATGAAGG 
TCTACCGAAG 
AAT TCCTCAT 
ATCAGCCTCG 
TTCCTTGACC 
ATCGCATTGT 
GGGGGAGGAT 
TGAGGCGGAA 
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<210> 
<211> 

<212> 
<400> 


21 

3589 

nucleic acid 
21 


CGTTACATAA 
GACGTCAATA 
ATGGGTGGAG 
AAGTACGCCC 
CATGACCTTA 
CATGGTGATG 
ATTTCCAAGT 
GGACTTTCCA 
ACGGTGGGAG 
CCATCCACGC 
GGAACGGTGC 
GTTTCCTTCC 
TGCTAGCAGT 
TTGCTTTACT 
ATCTGCCTCA 
GGAGAATCTC 
AGTTTGGCAA 


CTTACGGTAA 

ATGACGTATG 

TATTTACGGT 

CCTATTGACG 

TGGGACTTTC 

CGGTTTTGGC 

CTCCACCCCA 

AAATGTCGTA 

GTCTATATAA 

TGTTTTGACC 

ATTGGAACGC 

CCTGCTATTC 

AATACTAACG 

GGTGGCCCTC 

AACCCACAGC 

TCTTTTCTCC 

CCAGTTCCAA 


ATGGCCCGCC 
TTCCCATAGT 
AAACTGCCCA 
TCAATGACGG 
CTACTTGGCA 
AGTACATCAA 
TTGACGTCAA 
ACAACTCCGC 
GCAGAGCTCG 
TCCATAGAAG 
GGATTCCCCG 
TGCTCAACCT 
GTTCTTTTTT 
CTGGTGCTCA 
CTGGGTAGCA 
TGCTTGAAGG 
AAGGCTGAAA 


TGGCTGACCG 
■AACGCCAATA 
CTTGGCAGTA 
TAAATGGCCC 
GTACATCTAC 
TGGGCGTGGA 
TGGGAGTTTG 
CCCATTGACG 
TTTAGTGAAC 
ACACCGGGAC 
TGTTAATTAA 
TCCTATCAGA 
TCTCTT C AC A 
GCTGCAAGTC 
GGAGGACCTT 
ACAGACATGA 
CCATCCCTGT 


CCCAACGACC 
GGGACTTTCC 
CATCAAGTGT 
GCCTGGCATT 
GTATTAGTCA 
TAGCGGTTTG 
TTTTGGCACC 
CAAATGGGCG 
CGTCAGATCG 
CGATCCAGCC 
CAGGTAAGTG 
AACTGCAGTA 
GGCCACCATG 
AAGCTGCTCT 
GATGCTCCTG 
CTTTGGATTT 
CCTCCATGAG 


CCCGCCCATT 
AT TG ACGTC A 
ATCATATGCC 
ATGCCCAGTA 
TCGCTATTAC 
ACTCACGGGG 
AAAATCAACG 
GTAGGCGTGT 
CCTGGAGACG 
TCCGCGGCCG 
TCTTCCTCCT 
TCTGTATTTT 
GCCTTGACCT 
GTGGGCTGTG 
GCACAGATGA 
CCCCAGGAGG 
ATGATCCAGC 
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AGATCTTCAA 
ACAAATTCTA 
GGGTGGGGGT 
ACT TC CAAAG 
TTGTCAGAGC 
GAAGTAAGGA 
CCCCTCCCCA 
CCTAATAAAA 
GTGGAGGGGG 
GGGGGCCCGG 
C ATGGT CAT A 
GAGCCGGAAG 
TTGCGTTGCG 
GAATCGGCCA 
TCACTGACTC 
CGGTAATACG 
GCCAGCAAAA 
GCCCCCCTGA 
GACTATAAAG 
CCCTGCCGCT 
ATAGCTCACG 
TGCACGAACC 
CCAACCCGGT 
GAGCGAGGTA 
CTAGAAGGAC 
TTGGTAGCTC 
AGCAGCAGAT 
GGTCTGACGC 
GAGCGGCGAT 
CAATATCACG 
AGTCGATGAA 
CATGCGTCAC 
CGGCTGGCGC 
CCATCCGAGT 
CCGGATCAAG 
GAG CAAGGTG 
TTCCCGCTTC 
ACGATAGCCG 
CAAAAAGAAC 
TTGTCTGTTG 
CGTGCAATCC 
• TTGATCCCCT 
GCTTCCCAAC 


TCTCTTCAGC 
CACTGAACTC 
GACAGAGACT 
AATCACTCTC 
AG AAATC AT G 
ATGAATCTAG 
GTGCCTCTCC 
TTAAGTTGCA 
GTGGTATGGA 
TACCAGCTTT 
GCTGTTTCCT 
CATAAAGTGT 
CTCACTGCCC 
ACGCGCGGGG 
GCTGCGCTCG 
GTTATCCACA 
GGCCAGGAAC 
CGAGCATCAC 
ATACCAGGCG 
TACCGGATAC 
CTGTAGGTAT 
CCCCGTTCAG 
AAGACACGAC 
TGTAGGCGGT 
AGTATTTGGT 
TTGATCCGGC 
TACGCGCAGA 
TCAGAAGAAC 
ACCGTAAAGC 
GGTAGCCAAC 
TCCAGAAAAG 
GACGAGATCC 
GAGCCCCTGA 
ACGTGCTCGC 
CGTATGCAGC 
AGATGACAGG 
AGTGACAACG 
CGCTGCCTCG 
CGGGCGCCCC 
TGCCCAGTCA 
ATCTTGTTCA 
GCGCCATCAG 
CTTACCAGAG 


ACAAAGGACT 
TACCAGCAGC 
CCCCTGATGA 
TATCTGAAAG 
AGATCTTTTT 
AAAAGCCGAA 
TGGCCCTGGA 
TCATTTTGTC 
GCAAGGGGCA 
TGTTCCCTTT 
GTGTGAAATT 
AAAGCCTGGG 
GCTTTCCAGT 
AGAGGCGGTT 
GTCGTTCGGC 
GAATCAGGGG 
CGTAAAAAGG 
AAAAAT CG AC 
TTTCCCCCTG 
CTGTCCGCCT 
CTCAGTTCGG 
CCCGACCGCT 
TTATCGCCAC 
GCTACAGAGT 
ATCTGCGCTC 
AAACAAACCA 
AAAAAAGGAT 
TCGTCAAGAA 
ACGAGGAAGC 
GCTATGTCCT 
CGGCCATTTT 
TCGCCGTCGG 
TGCTCTTCGT 
TCGATGCGAT 
CGCCGCATTG 
AGATCCTGCC 
TCGAGCACAG 
TCCTGCAGTT 
TGCGCTGACA 
TAGCCGAATA 
ATCATGCGAA 
ATCCTTGGCG 
GGCGAATTCG 


CATCTGCTGC 
TGAATGACCT 
AGGAGGACTC 
AGAAGAAATA 
CTTTGTCAAC 
TTCTGCAGGA 
AGTTGCCACT 
TGACTAGGTG 
AGTTGGGAAG 
AGTGAGGGTT 
GTTATCCGCT 
GTGCCTAATG 
CGGGAAACCT 
TGCGTATTGG 
TGCGGCGAGC 
ATAACGCAGG 
CCGCGTTGCT 
GCTCAAGTCA 
GAAGCTCCCT 
TTCTCCCTTC 
TGTAGGTCGT 
GCGCCTTATC 
TGGCAGCAGC 
TCTTGAAGTG 
TGCTGAAGCC 
CCGCTGGTAG 
CTCAAGAAGA 
GGCGATAGAA 
GGTCAGCCCA 
GATAGCGGTC 
CCACCATGAT 
GCATGCGCGC 
CC AGAT CAT C 
GTTTCGCTTG 
CAT CAGCC AT 
CCGGCACTTC 
CTGCGCAAGG 
CATTCAGGGC 
GCCGGAACAC 
GCCTCTCCAC 
ACGATCCTCA 
GCAAGAAAGC 
AGCTTGCATG 


TTGGGATGAG 
GGAAG CCTGT 
CATTCTGGCT 
CAGCCCTTGT 
AAACTTGCAA 
ATTGGGTGGC 
CCAGTGCCCA 
TCCTTCTATA 
ACAACCTGTA 
AATTTCGAGC 
CACAATTCCA 
AGTGAGCTAA 
GTCGTGCCAG 
GCGCTCTTCC 
GGTATCAGCT 
AAAGAACATG 
GGCGTTTTTC 
GAGGTGGCGA 
CGTGCGCTCT 
GGGAAGCGTG 
TCGCTCCAAG 
CGGTAACTAT 
CACTGGTAAC 
GTGGCCTAAC 
AGTTACCTTC 
CGGTGGTTTT 
TCCTTTGATC 
GGCGATGCGC 
TTCGCCGCCA 
CGCCACACCC 
ATTCGGCAAG 
CTTGAGCCTG 
CTGATCGACA 
GTGGTCGAAT 
GATGGATACT 
GCCCAATAGC 
AACGCCCGTC 
ACCGGACAGG 
GGCGGCATCA 
CCAAGCGGCC 
TCCTGTCTCT 
CATCCAGTTT 
CCTGCAGGT 


ACCCTCCTAG 
GTGATACAGG 
GTGAGGAAAT 
GCCTGGGAGG 
GAAAGTTTAA 
ATCCCTGTGA 
CCAGCCTTGT 
ATATTATGGG 
GGGCTCGAGG 
TTGGCGTAAT 
CACAACATAC 
CTCACATTAA 
CTGCATTAAT 
GCTTCCTCGC 
CACTCAAAGG 
TGAGCAAAAG 
CATAGGCTCC 
AACCCGACAG 
CCTGT TCCGA 
GCGCTTTCTC 
CTGGGCTGTG 
CGTCTTGAGT 
AGGATTAGCA 
TACGGCTACA 
GGAAAAAGAG 
TTTGTTTGCA 
TTTTCTACGG 
TGCGAATCGG 
AGCTCTTCAG 
AGCCGGCCAC 
CAGGCATCGC 
GCGAACAGTT 
AGACCGGCTT 
GGGCAGGTAG 
TTCTCGGCAG 
AGCCAGTCCC 
GTGGCCAGCC 
TCGGTCTTGA 
GAGCAGCCGA 
GGAGAACCTG 
TGATCAGATC 
ACTTTGCAGG 
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<210> 
<211> 
<212> 
<400> 


22 
567 

nucleic acid 
22 


ATGGCCTTGA 
TCTGTGGGCT 
CTGGCACAGA 
TTTCCCCAGG 
GAGATGATCC 
5 5 GAGACCCTCC 
TGTGTGATAC 
GCTGTGAGGA 
TGTGCCTGGG 
CAAGAAAGTT 

60 


CCTTTGCTTT 
GTGATCTGCC 
TGAGGAGAAT 
AGGAGTTTGG 
AGCAGATCTT 
TAGACAAATT 
AGGGGGTGGG 
AATACTTCCA 
AGGTTGTCAG 
TAAGAAGTAA 


ACTGGTGGCC 
TCAAACCCAC 
CTCTCTTTTC 
CAACCAGTTC 
CAATCTCTTC 
CTACACTGAA 
GGTGACAGAG 
AAGAATCACT 
AGCAGAAATC 
GGAATGA 


CTCCTGGTGC 
AGCCTGGGTA 
TCCTGCTTGA 
CAAAAGGCTG 
AGCACAAAGG 
CTCTACCAGC 
ACTCCCCTGA 
CTCTATCTGA 
ATGAGATCTT 


TCAGCTGCAA 
GCAGGAGGAC 
AGGACAGACA 
AAACCATCCC 
ACTCATCTGC 
AGCTGAATGA 
TGAAGGAGGA 
AAGAGAAGAA 
TTTCTTTGTC 


GTCAAGCTGC 
CTTGATGCTC 
TGACTTTGGA 
TGTCCTCCAT 
TGCTTGGGAT 
CCTGGAAGCC 
CTCCATTCTG 
ATACAGCCCT 
AACAAACTTG 
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<210> 23 
<211> 16 
<212> nucleic acid 

<223> «y« stands for C or T; »N" stands for any base 
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<210> 24 
10 <211> 660 

<212> nucleic acid 

<223> »Y" stands for C or T; -R» stands for A or G; 

"W" stands for T or A; "S" stands for C or G; "N" 
stands for any base. 
15 <400> 24 


E~ KSS3 T T ^= 

20 TTYTAYCCNT GyIcSc* 2™!*"™ ATGYTNCARA ARGCNMGNCA SyS 

. aiissi » 

ESSS5 ESSES ESSES ™£I S 
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30 <210> 25 

<211> 987 

<212> nucleic acid 

<223> «y» stands for c Qr T; „ R „ stands fQr a ^ Q; fiw> 

stands for T or A; "S" stands for C or G; "N" 

35 stands for any base. 

<400> 25 


- anna 

- SSSSSS 

so sssssa sssss ssjss ssssss ssssE ESS 

ACNTGGWSNA CNCCNCAYWS NTAYTTYWSN YTNACNTTYT GYGTNCARGT 
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WSNAARMGNG ARAARAARGA YMGNGTNTTY 
MGNAARAAYG CNWSNATHWS NGTNMGNGCN 
GARTGGGCNW SNGTNCCNTG YWSNTRR 


ACNGAYAARA CNWSNGCNAC NGTNATHTGY 
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INTERNATIONAL SEARCH REPORT 


Inf -national Application No 

P^i/US 99/05394 


A. CLASSIFICATION OF SUBJECT MATTER * " 

IPC 6 C12N15/21 C07K14/56 C12N15/88 C12N15/24 C07K14/54 
C12N15/85 A61K48/00 C12N5/10 //A61K9/127 


According to international Patent Qassrfication (IPQ or to both national classification and IPC 
B. FIELDS SEARCHED 


Minimum documentation searcned iC^ssDication system followed by classification symbols) 

IPC 6 C07K A61K C12N 


Documentation searched other than m.n.mum Documentation to tne extent that Such documents are included in the lieids s 


Electronic data oase consulted during ma international search (name of data case and where practical search terms used) 


C. DOCUMENTS CONSIDERED TO BE RELEVANT 


Category - Citation ot document, with indication, where appropriate, of the relevant passages 


WO 96 17063 A (VICAL INC) 
6 June 1996 (1996-06-06) 


the whole document especially 

page 46 ; line 1 - line 37; claims; figure 

1 


Relevant to claim No. 


1-10, 

22-30, 

33-36, 

42-45, 

49-66,68 


j X | Furtner documents are listed in the continuation ot box C 


0 


Patent family members are listed in annex. 


* Special categories of cited documents : 

"A" document defining the general state of the art which is not 

considered to be ol particular relevance 
"E" earlier document but published on or after the international 

filing date 

"L" document which may throw doubts on priority ciaim(s) or 
which is cited to establish the publication date ol another 
citation or other special reason (as specified ) 

"O" document referring to an oral disclosure, use. exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority oate claimed 


Date of (he actual completion ol tne international search 

12 October 1999 


T" later document published after the international filing date 
or priority dale and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X" document of particular relevance: the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document ol particular relevance: the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious lo a person skilled 
in the art. 

document member of the same patent family 


Name and mailing address ot the ISA 

European Patent Office. P 6 5618 Patentlaan 2 
ML - 2280 HV R.rswijk 
Tel (*3l-70) 3^0-2040. T* 3t65ieponl. 
Fax: ( *3i-70) 3*0-3016 


Porm PCT.*l5A/?i0 (second snaen (July 1992) 


Date of mailing ol the international search report 


21/10/1999 


Authorized officer 


Le Cornec , N 


page 1 of 4 


INTERNATIONAL SEARCH REPORT 


tnf -national Application No 

Pv,./US 99/05394 


C.(Continuation> DOCUMENTS CONSIDERED TO BE RELEVANT 


Category i Citation ol document, wrtn <ndicatioa where appropriate, ol the relevant passages 


BELLDEGRUN A ET AL: "HUMAN RENAL 
CARCINOMA LINE TRANSFECTED WITH 
INTERLEUKIN-2 AND/OR INTERFERON ALPHA 
GENE(S): IMPLICATIONS FOR LIVE CANCER 
VACCINES" 

JOURNAL OF THE NATIONAL CANCER INSTITUTE , 
vol . 85, no. 3, 

3 February 1993 (1993-02-03), pages 
207-216. XP002057839 

ISSN: 0027-8874 
cited in the application 
the whole document 

RUSSELL J. MUMPER ET AL: "Polyvinyl 

derivatives as novel interactive polymers 

for controlled Gene delivery to muscle" 

PHARMACEUTICAL RESEARCH, 

vol. 13, no. 5, May 1996 (1996-05), pages 

701-709, XP002118167 

the whole document 

H. ALILA ET AL : "Expression of 
biologically active human Insulin-like 
growth factor- I following intramuscular 
injection of a formulated plasmid in rats" 
HUMAN GENE THERAPY , 
vol . 8 . no . 15 , 

10 October 1997 (1997-10-10), pages 
1785-1795, XP002118452 
cited in the application 
the whole document 

W0 97 33998 A (CHIOU HENRY C ;CARLO DENNIS 
0 (US); IMMUNE RESPONSE CORP INC (US)) 
18 September 1997 (1997-09-18) 

page 16, line 30 - line 35; example 1 

R.J. MUMPER ET AL: "protective 
interactive noncondensing (PINO polymers 
for enhanced plasmid distribution and 
expression in rat skeletal muscle" 
JOURNAL OF CONTROLLED RELEASE, 
vol. 52, 2 March 1998 (1998-03-02), pages 
191-203, XP004113667 
the whole document 

-/-- 


1-10, 

22-30, 

33-36, 

42-45, 

49-66,68 


1-10, 

22-30, 

33-36, 

42-45, 

49-66,68 


1-10, 

22-30, 

33-36, 

42-45, 

49-66,68 


1-10, 
22-30, 
42-45, 
49-66,68 


1-10, 

22-30, 

33-36, 

42-45, 

49-66,68 


Forni PCf-'lSA/2lO (conxooatioo ot second sfteei) Uiiy 1992) 


page 2 of 4 


INTERNATIONAL SEARCH REPORT 


)nt*—»ational Application No 

Put /US 99/05394 


C. (Continuation) OOCUMENTS CONSIDERED TO BE RELEVANT 


Citation oi document, with .ndicatton. where appropriate, of the relevant passages 


Relevant to clatm No. 


FERRANTINI M ET AL: "ALPHAl -INTERFERON 
GENE TRANSFER INTO METASTATIC FRIEND 
LEUKEMIA CELLS ABROGATED TUMORIGENICITY IN 
IMMUNOCOMPETENT MICE: ANTITUMOR THERAPY BY 
MEANS OF INTERFERON-PRODUCING CELLS" 
CANCER RESEARCH, 

vol. 53, 1 March 1993 (1993-03-01), pages 
1107-1112, XP002015124 

ISSN: 0008-5472 
cited in the appl ication 
the whole document 

WO 97 00085 A (UNIV MEDICINE & DENTISTRY 
OF N) 3 January 1997 (1997-01-03) 
cited in the application 


page 41 -page 52; claims; example 2 

W0 96 21470 A (R0LLAND ALAIN ; GENEMEDICINE 
INC (US); MUMPER RUSSELL J (US)) 
18 July 1996 (1996-07-18) 
cited in the application 

page 14 -page 15, line 10; claims; 
examples 1,4 

WO 97 00321 A (WOOD PAUL ; SEOW HENG FONG 
(AU); COMMW SCIENT IND RES ORG (AU)) 
3 January 1997 (1997-01-03) 
page 31, line 11 -page 32, line 7; claims 

M. FERRANTINI ET AL: "IFN-alphal gene 

expression into a metastatic murine 

adenocarcinoma (TS/A) results in CD8+ T 

cell-mediated tumor rejection and 

development of antitumor immunity" 

JOURNAL OF IMMUNOLOGY, 

vol. 153, 1994, pages 4604-4615 

XP002118168 

the whole document 

GAO X ET AL: "CATIONIC LIPOSOME-MEDIATED 
GENE TRANSFER" 
GENE THERAPY, 
vol. 2, no. 10, 

1 December 1995 (1995-12-01), pages 
710-722, XP000749400 
ISSN: 0969-7128 


1-10. 

22-30, 

33-36. 

42-45. 

49-66.68 


1-10, 

22-30, 

33-36, 

42-45, 

49-66,68 


1-10, 

22-30, 

33-36, 

42-45, 

49-66,68 


11-21 


1-69 


-/-- 


Fofin PCniSA^iO(conanuatioool socond srweiMJuty 19S2( 


page 3 of 4 


INTERNATIONAL SEARCH REPORT 


lntf»--*alional Application No 

P^./US 99/05394 


C(Continuatton) DOCUMENTS CONSIDERED TO BE RELEVANT 


Category I Citation oi document, with mdtcation.wnere appropriate, ot the relevant passages 


P,X 


P,A 


P,X 


P. A 


H. COLEMAN ET AL: "Nonviral Interferon 
alpha Gene therapy inhibits growth of 
established tumors by eliciting a systemic 
immune response" 
HUMAN GENE THERAPY , 
vol 9, 10 October 1998 (1998-10-10), 
pages 2223-2230, XP002118169 
the whole document 


W0 98 34952 A (GENEMEDICINE INC) 
13 August 1998 (1998-08-13) 


The whole document especially 

page 11, line 8 - line 33; claims; figure 

2 

page 47, line I -page 49, line 23 

WO 98 17689 A (DESHPANDE DEEPA ;FREIMARK 
BRUCE (US); NORDSTROM JEFF (US); GENEMED) 
30 April 1998 (1998-04-30) 
cited in the appl ication 


the whole document 


SK MENDIRATTA ET AL: " Intratumoral 
delivery of iL-12 gene by polyvinyl 
polymeric vector system to murine renal 
and colon carcinoma results in potent 
antitumor immunity" 
GENE THERAPY , 

vol. 6, no. 5, May 1999 (1999-05), pages 
83-839, XP002118170 


1-8, 
22-30, 
43-45, 
50-65 


40,42, 
46,47, 
49,66-69 

1-8, 
22-40, 
42-46, 
49-65 t 
68,69 


I- 8, 
30-39, 
42,43, 
46, 

49-62, 
65,68,69 

II- 21, 
40,41, 
44,45, 
47,48, 
63,66,67 


Form PCT/ISA/210 (contnxianon ot second snael) (July 1 992 i 


page 4 of 4 


INTERNATIONAL SEARCH REPORT 


of national appuca'ion No. 

PCT/US 99/05394 


Box I Observations where certain claims were found unsearchable (Continuation of item 1 of first sheet) 


This mternanonal Search Report has not been established in respect of certain cla.ms under Article ^ 7(2 K a) (or the blowing reasc 
1 \T\ Claims Nos.: 

because ihey relate to sub t ect matter not required to be searched by this Authority, namely: 

Remark: Although claims 50-55, 59-61, 63-69 

are directed to a method of treatment of the human/animal 
body (rule 39.1 (IV) PCT, the search has been carried out and 
based on the alleged effects of the compound/composition. 

2. Claims Nos.: 

5S?^!! e » t 5f y / ela,e '° panS °' the ,n,ernational Application that do not comply with the prescribed requirements to such 
an extent that no meaningful international Search can be earned out. specifically: requirements to such 


3. 1 | Claims Nos.: 

because they are dependent cia.ms and are not drafied in accordance with the second and third sentences of Rule 6.4(a). 


Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 


This International Search.ng Author.ty found multiple inventions in this international application, as follows: 


1 ' Q se 5 arc l h r ableclaim < s d,, ' 0na ' ^ t,me ' V ^ bV thS applicant ' ,h)S International Search Report covers all 

2 ' ^ ^y^^t^" 1 " 3 C ° U,d ^ SearChed w ' thOU ' Btion ius,,f V ,n 9 an addihonal fee. mis Authority d.d not inv.te payment 


3 - Q ^^^^^J^Z^ a h dd ' t, ? na ' search '!f s were time 'V P aid b * ^e applicant. th,s International Search Report 
covers only those claims for which lees were paid, specifically claims Nos.: 


□ 


IlsS^n fht^n! S6a / Ch f6eS W6re " melV Paid by the a PP |( «nt. Consequently, this International Search Report « 
restricted to the invention first mentioned in the claims: it ts covered by claims Nos : 


Remark on Protest j~ | The adtJlll0nal searCn , ees were accompanied by the applicant s protest. 

| | No protest accompanied Ihe payment of additional search fees. 


Form PCT/ISA/210 (continuation of first sneet (1)) (July 1998) 


INTERNATIONAL SEARCH REPORT 

information on patent family members 


lm^~»ational 

P^./US 


application No 

99/05394 


Patent document 
cited in search report 


WO 9617063 


Publication 
date 


Patent family 
member(s) 


06-06-1996 


US 5641665 A 
EP 0795015 A 
Jp 10509877 T 


Publication 
date 


24-06-1997 
17-09-1997 
29-09-1998 


W0 9733998 

A 

18-09- 

1997 

At 1 
HU 

CA 

rp 
tr 

2248538 A 
DQ04373 A 

01-10-1997 
18-09-1997 
31-03-1999 

WO 9700085 

A 

03-01- 

1997 

All 
AU 

EP 

6782896 A 

U lulu J v n 

0835130 A 

15-01-1997 
15-04-1998 

WO 9621470 

A 

18-07- 

1996 

AU 
AU 
CA 
EP 
JP 

703419 B 
4611096 A 
2210132 A 
0794798 A 
11502507 T 

25-03-1999 
31-07-1996 
18-07-1996 
17-09-1997 
02-03-1999 

WO 9700321 

A 

03-01- 

-1997 

AU 

5991796 A 

15-01-1997 

W0 9834952 

A 

13-08 

-1998 

AU 

6269298 A 

26-08-1998 

W0 9817689 

A 

30-04 

-1998 

AU 

5146898 A 

15-05-1998 


Form PCTflSA/210 IpMem lamdy ann«i) IJuiy 1992) 


8NSOOCID: <WO 9947678A3 IA> 


